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Abstract 

We consider slow-fast systems of differential equations, in which both the slow and fast 
variables are perturbed by noise. When the deterministic system admits a uniformly 
asymptotically stable slow manifold, we show that the sample paths of the stochastic 
I system are concentrated in a neighbourhood of the slow manifold, which we construct 

p I ■ explicitly. Depending on the dynamics of the reduced system, the results cover time 

spans which can be exponentially long in the noise intensity squared (that is, up to 
Kramers' time). We obtain exponentially small upper and lower bounds on the prob- 
ability of exceptional paths. If the slow manifold contains bifurcation points, we show 
similar concentration properties for the fast variables corresponding to non-bifurcating 
modes. We also give conditions under which the system can be approximated by a 
■ lower-dimensional one, in which the fast variables contain only bifurcating modes. 
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! 1 Introduction 

Systems involving two well-separated timescales are often described by slow-fast differen- 
tial equations of the form 

H , = f{x,y,e), 

y = g[x,y,e), 

where e is a small parameter. Since x can be much larger than y, x is called the fast variable 
and y is called the slow variable. Such equations occur, for instance, in climatology, with 
the slow variables describing the state of the oceans, and the fast variables the state of 
the atmosphere. In physics, slow-fast equations model in particular systems containing 
heavy particles (e.g. nuclei) and light particles (e.g. electrons). Another example, taken 
from ecology, would be the dynamics of a predator-prey system in which the rates of 
reproduction of predator and prey are very different. 

The system (1.1) behaves singularly in the limit e — > 0. In fact, the results depend 
on the way this limit is performed. If we simply set e to zero in (1.1), we obtain the 
algebraic-differential system 

= f{x,y,0), 

y = 5(2;, y,o). 
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Assume there exists a difFerentiable manifold with equation x = x*{y) on which / = 0. 
Then x = x*{y) is called a slow manifold, and the dynamics on it is described by the 
reduced equation 

y = g{x''{y),y,0). (1.3) 

Another way to analyze the limit £ —> is to scale time by a factor 1/e, so that the 
slow-fast system (1.1) becomes 

x' = f{x,y,e), 

y' = £g{x,y,£) 

In the limit £ — ^ 0, we obtain the so-called associated system 

x' = f{x,y,0), 

y' = o, 

in which y plays the role of a parameter. The slow manifold x = x*(y) consists of equi- 
librium points of (1.5), and (1.4) can be viewed as a perturbation of (1.5) with slowly 
drifting parameter y. 

Under certain conditions, both the reduced equation (1.3) and the associated sys- 
tem (1.5) give good approximations of the initial slow-fast system (1.1), but on different 
timescales. Assume for instance that for each y, x*{y) is an asymptotically stable equilib- 
rium of the associated system (1.5). Then solutions of (1.1) starting in a neighbourhood 
of the slow manifold will approach x*{y) in a time of order e|loge|. During this time 
interval they are well approximated by solutions of (1.5). This first phase of the motion is 
sometimes called the boundary-layer hehaYiour. For larger times, solutions of (1.1) remain 
in an ^-neighbourhood of the slow manifold, and are thus well approximated by solutions 
of the reduced equation (1.3). This result was first proved by Gradstein [17] and Tihonov 
[28]. 

Fenichel [13] has given results allowing for a geometrical description of these phenomena 
in terms of invariant manifolds. He showed, in particular, the existence of an invariant 
manifold 

x = x{y,e), with x{y,e) = x*{y) + 0{e), (1.6) 

for sufficiently small e, whenever x*{y) is a family of hyperbolic equilibria of the associated 
system (1.5). The dynamics on this invariant manifold is given by the equation 

y = g{x{y,£),y,e), (1.7) 

which can be treated by methods of regular perturbation theory, and reduces to (1.3) in 
the limit e — > 0. In fact, Fenichel's results are more general. For instance, if x*{y) is a 
saddle, they also show the existence of invariant manifolds associated with the stable and 
unstable manifolds of x*{y). See [19] for a review. 

New, interesting phenomena arise when the dynamics of (1.7) causes y to approach a 
bifurcation point of (1.5). For instance, the passage through a saddle-node bifurcation, 
corresponding to a fold of the slow manifold, produces a jump to some other region in 
phase space, which can cause relaxation oscillations and hysteresis phenomena (see in 
particular [26] and [18], as well as [23] for an overview). Transcritical and pitchfork 
bifurcations generically lead to a smoother transition to another equilibrium [22, 21], 
while the passage through a Hopf bifurcation is accompanied by the delayed appearance 
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(1.4) 



(1.5) 



of oscillations [24, 25]. There exist many more recent studies of what has become known 
as the field of dynamic bifurcations, sec for instance [4]. 

In many situations, low-dimensional ordinary differential equations of the form x = 
f{x) are not sufficient to describe the dynamics of the system under study. The effect of 
unknown degrees of freedom is often modelled by noise, leading to a stochastic differential 
equation (SDE) of the form 

dxt = fixt)dt + aF{xt)dWt, (1.8) 

where cr is a small parameter, and Wt denotes a standard, generally vector-valued Brown- 
ian motion. On short timescales, the main effect of the noise term aF{xt) dWt is to cause 
solutions to fluctuate around their deterministic counterpart, but the probability of large 
deviations is very small (of the order e"*^""**/*^ ). On longer timescales, however, the noise 
term can induce transitions to other regions of phase space. 

The best understood situation is the one where / admits an asymptotically stable 
equilibrium point x* . The first-exit time t{uj) of the sample path xtioj) from a neighbour- 
hood of X* is a random variable, the characterization of which is the object of the exit 
problem. If / derives from a potential U (i. e., / = —VU) of which x* is a local minimum, 
the asymptotic behaviour of the typical first-exit time for a <C 1 has been long known by 
physicists: it is of order e^^/^ , where H is the height of the lowest potential barrier sep- 
arating X* from other potential wells. A theory of large deviations generalizing this result 
to quite a large class of SDEs has been developed by Freidlin and Wentzell [16]. More 
detailed information on the asymptotics of the expected first-exit time has been obtained, 
see [2, 14] and the very precise results by Bovier, Eckhoff, Gayrard and Klein [8, 9] on the 
relation between the expected first-exit time, capacities and the spectrum of the generator 
of the diffusion. The distribution of r has been studied by Day [11]. 

The more difficult problem of the dynamics near a saddle point has been considered in 
[20] and in [12]. The situation where / depends on a parameter and undergoes bifurcations 
has not yet been studied in that much detail. An approach based on the notion of random 
attractors [27, 1, 10] gives information on the limit t oo, when the system has reached 
a stationary state. Note, however, that the time needed to reach this regime, in which (in 
the gradient case) xt is most likely to be found near the deepest potential well, may be 
very long if the wells are separated by barriers substantially higher than a^. The dynamics 
on intermediate timescales, known as the metastable regime, is not yet well understood in 
the presence of bifurcations. 

In this work, we are interested in the effect of noise on slow-fast systems of the 
form (1.1). Such systems have been studied before in [15], using techniques from large 
deviation theory to describe the limit o" ^ 0. Here we use different methods to give a 
more precise description of the regime of small, but finite noise intensity, our main goal 
being to estimate quantitatively the noise-induced spreading of typical paths, as well as 
the probability of exceptional paths. We will consider situations in which both the slow 
and fast variables are affected by noise, with noise intensities taking into account the 
difference between the timescales. In (1.8), the diffusive nature of the Brownian motion 
causes paths to spread like a\/i. In the case of the slow-fast system (1.1), we shall choose 
the following scaling of the noise intensities: 

dxt = ^f{xt, yt, e) dt + -^F{xt, yt, e) dWt, 
dyt = g{xt, yt, e) dt + a'G{xt, yt, s) dWt. 
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In this way, cr^ and (cr')^ both measure the ratio between the rate of diffusion squared 
and the speed of drift, respectively, for the fast and slow variable. We consider general 
finite-dimensional x G M" and y € M™", while Wt denotes a /c-dimensional standard Brow- 
nian motion. Accordingly, F and G are matrix-valued functions of respective dimensions 
n X k and mx k. The matrices F{x, y, e) will be assumed to satisfy some (rather weak) 
nondegeneracy condition, sec Remark 2.3. We consider e, a and a' as small parameters, 
and think of a and a' as functions of e. We limit the analysis to situations where a' 
does not dominate a, i. e., we assume a' = pa where p may depend on e but is uniformly 
bounded above in e. 

We first consider the case where the deterministic slow fast system (1.1) admits an 
asymptotically stable slow manifold x*{y). Our first main result. Theorem 2.4, states that 
the sample paths of (1.9) are concentrated in a "layer" surrounding the adiabatic manifold 
x{y,e), of the form 

B{h) = {ix,y): {ix-xiy,e)),X{y,e)-\x - xiy,e))) < h^} (1.10) 

up to time t, with a probability behaving roughly like 1 — (t^/e) e~'*^/^°'^ as long as the 
paths do not reach the vicinity of a bifurcation point. The matrix X{y,e), defining the 
ellipsoidal cross-section of the layer, is itself a solution of a slow-fast system, and depends 
only on the values of F and dxf on the slow manifold. In particular, X{y, 0) is a solution 
of the Lyapunov equation 

A*{y)X + XA'iyf + F(x*(y), y, 0)F(x'^(y), y, 0)^ = 0, (1.11) 

where A*{y) = dxf{x*{y),y,0). For instance, if / derives from a potential U, —A* is the 
Hessian matrix of U at its minimum, and i3{h) is more elongated in those directions in 
which the curvature of U is smallest. 

Theorem 2.6 gives a more detailed description of the dynamics inside B{h), by show- 
ing that paths {xt,yt) are concentrated in a neighbourhood of the deterministic solution 
{xf'^^ ,yf'^^) at least up to times of order 1. The spreading in the y-direction grows at a 
rate corresponding to the finite-time Lyapunov exponents of the deterministic solution. 

Next we turn to situations where the deterministic solution approaches a bifurcation 
point of the associated system. In this case, the adiabatic manifold x{y, e) is not defined 
in general. However, by splitting x into a stable direction x~ and a bifurcating direction 
z, one can define a (centre) manifold x~ = x~ {z, y, e) which is locally invariant under the 
deterministic flow. Theorem 2.8 shows that paths of the stochastic system are concentrated 
in a neighbourhood of x~{z, y, e). The size of this neighbourhood again depends on noise 
and linearized drift term in the stable x~ -direction. 

In order to make use of previous results on the passage through bifurcation points for 
one-dimensional fast variables, such as [7, 5, 6], it is necessary to control the deviation 
between solutions of the full system (1.9), and the reduced stochastic system obtained 
by setting xr equal to x~{z,y,e). Theorem 2.9 provides such an estimate under certain 
assumptions on the dynamics of the reduced system. 

We present the detailed results in Section 2, Subsection 2.2 containing a summary of 
results on deterministic slow-fast systems, while Subsection 2.3 is dedicated to the random 
case with a stable slow manifold and Subsection 2.4 to the case of bifurcations. Sections 3 
to 5 contain the proofs of these results. 
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2 Results 

2.1 Preliminaries 

Let V be an open subset of M" x M"* and Eq > a constant. We consider slow-fast 
stochastic differential equations of the form 



with drift coefficients / G C'^{V x [0,eo),IR") and g G C'^{V x [0, eo), 1^"*), and diffusion 
coefficients F eC^{V x [0,eo),M"^*^) and G x [0,£o),K'"^'=). 

We require that /, and all their derivatives up to order 2 are uniformly bounded in 
norm in P x [0, eg)) and similarly for F, G and their derivatives. We also assume that / 
and g satisfy the usual (local) Lipschitz and bounded-growth conditions which guarantee 
existence and pathwise uniqueness of a strong solution {{xt^ytY^f^to of (2.1). 

The stochastic process {Wt}t^o is a standard /c-dimensional Brownian motion on some 
probability space (O,.?^, P), and stochastic integrals with respect to {VFtjf^o arc to be 
understood as Ito integrals. Initial conditions (,to,?/o) are always assumed to be squarc- 
integrable with respect to P and independent of {VFtjt^o- Our assumptions on / and g 
guarantee the existence of a continuous version of {{xt.,yt)}t^o- Therefore we may assume 
that the paths u i— > {xt{u)^yt{ijj)) are continuous for P-almost all w G 

We introduce the notation ^*o,(xo,yo) ^j^g j^w of the process {{xt,yt)^t'^tm starting 
in (a;o,yo) at time to, and use E*°'(^°'^°) to denote expectations with respect to p*0'(^0'fo). 
Note that the stochastic process {{xt.,yt)}t^to is a time-homogeneous Markov process. Let 
^ C P be Borel-measurable. Assuming (xq, yo) € .4, we denote by 



the first-exit time of {xt,yt) from A. Note that r4 is a stopping time with respect to the 
filtration of (r2,JF, P) generated by the Brownian motion {Wt}t^o. 
Throughout this work, we use the following notations: 

• Let a, b be real numbers. We denote by [a], a A 6 and a V 6, respectively, the smallest 
integer greater than or equal to a, the minimum of a and 6, and the maximum of a 
and b. 

• By g{u) = 0{u) we indicate that there exist 6 > and K > such that g{u) ^ Ku 
for all u e [0,S], where S and K of course do not depend on e, a or cr'. 

• We use ||x|| to denote the Euclidean norm of x G M*^ and (•, •) for the associated inner 
product. For a matrix A G R'^^^'^^, we denote by ||^|| the corresponding operator 
norm. If A[t) is a matrix-valued function defined for t in an interval I, we denote by 
11^11/ the supremum of ||^(i)|| over t e I, and often we write ||4||oo if the interval is 
evident from the context. 

• We write A'^ for the transposed of a matrix, and Tr A for the trace of a square matrix. 



dxt = -f{xt, yt, s) dt + —;=F{xt, yu e) dWt, 
e v£ 

dyt = g{xt, yt, s) dt + a'G{xt, yt, e) dWt, 



(2.1) 



r^ = inf{t ^0: ixt,yt) ^ A} 



(2.2) 



5 



• For a given set B, we denote by 1b the indicator function on B, defined by Ib{x) = 1; 
if a; € B, and 1_b(x) = 0, otherwise. 

• If M" X M™ 9 {x,y) ^ f{x,y) G M'^ is differentiablc, wc write dxf{x,y) and dyf{x,y) 
to denote the Jacobian matrices oi x ^ f{x,y) and y i-^ f{x,y), respectively. 

2.2 Deterministic stable case 

We start by recalhng a few properties of deterministic slow-fast systems of the form 

ex = f{x,y,s), 

y = 9{x,y,£)- 

Definition 2.1. Let C M"* and assume that there exists a (continuous) function 
X* : 2?o ^ such that 

• {x*{y), y) for all y G Vq, 

• f{x%y),y,0)=0 for ally gVq. 

Then the set {{x,y): x = x*{y),y G Vq} is called a slow manifold of the system (2.3). 
Let A*{y) = dxf{x*{y),y,0). The slow manifold is called 

• hyperbolic if all eigenvalues of A*{y) have nonzero real parts for all y EVq; 

• uniformly hyperbolic if all eigenvalues of A*{y) have real parts uniformly bounded 
away from zero (for y £ Vq ); 

• asymptotically stable if all eigenvalues of A*{y) have negative real parts for all y & Vq; 

• uniformly asymptotically stable if all eigenvalues of A*{y) have negative real parts, 
uniformly bounded away from zero for y E Vq. 

Gradstein [17] and Tihonov [28] have shown that if x* represents a uniformly hyperbolic 
slow manifold of (2.3), then the system (2.3) admits particular solutions which remain 
in a neighbourhood of order e of the slow manifold. If, moreover, the slow manifold is 
asymptotically stable, then the solutions starting in a neighbourhood of order 1 of the slow 
manifold converge exponentially fast in t/e to an e-neighbourhood of the slow manifold. 

Fenichel [13] has given extensions of this result based on a geometrical approach. 
If (2.3) admits a hyperbolic slow manifold, then there exists, for sufficiently small e, an 
invariant manifold 

X = x{y, e) = x*{y) + 0{e), y G Pq- (2.4) 

Here invariant means that if yo G Vq and xq = x{yo,e), then xt = x{yt,e) as long as t is 
such that ys G Vq for all s ^ t. We will call the set {{x{y,£),y): y G Vq} an adiabatic 
manifold. It is easy to see from (2.3) that x{y, e) must satisfy the PDE 

£dyx{y, £)g{x{y, e), y, e) = f{x{y, e), y, e). (2.5) 

The local existence of the adiabatic manifold follows directly from the centre manifold 
theorem. Indeed, we can rewrite System (2.3) in the form 

x' = f{x,y,£), 

y' = £g{x,y,£), (2.6) 
e' = 0, 

where prime denotes derivation with respect to the fast time t/e. Any point of the form 
{x*{y),y, 0) with y eVqis an equilibrium point of (2.6). The linearization of (2.6) around 
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such a point admits as eigenvalue of multiphcity m + 1, the n other eigenvalues being 
those of A*{y), which are bounded away from the imaginary axis. The centre manifold 
theorem implies the existence of a local invariant manifold x = x{y,£). Fenichel's result 
shows that this manifold actually exists for all y gVq. 

Being a centre manifold, the adiabatic manifold is not necessarily unique (though in 
the present case, x{y,0) = x*{y) is uniquely defined). Nevertheless, x{y,£) has a unique 
Taylor series in y and e, which can be obtained by solving (2.5) order by order. The 
dynamics on the adiabatic manifold is described by the so-called reduced equation 

y = g{x{y, s), y, e) = ff(x'^(y), y, 0) + 0{e). (2.7) 

If x*{y) is uniformly asymptotically stable, x{y,£) is locally attractive and thus any solu- 
tion of (2.3) starting sufficiently close to x{y,£) converges exponentially fast to a solution 
of (2.7). 



2.3 Random stable case 

We turn now to the random slow-fast system given by the stochastic differential equation 

dxt = -fixt,yt,£)dt+-^F{xt,yt,£)dWt, 

£ (2.8) 

dyt = g{xt, yt, e) dt + a'G{xt, yt, e) dWt, 
where we will assume the following. 

Assumption 2.2. For a = a' = 0, System (2.8) admits a uniformly hyperbolic, asymp- 
totically stable slow manifold x = x*{y), y £ Vq. 

By Fenichel's theorem, there exists an adiabatic manifold x = x{y,e) with x{y,0) = 
x*{y), y £ Vq. We fix a particular solution {xf'^^,yf^^) = {x{yf'^^,£),yf^^) of the deter- 
ministic system. (That is, y^^* satisfies the reduced equation (2.7).) We want to describe 
the noise-induced deviations of the sample paths {xt,yt)t^o of (2.8) from the adiabatic 
manifold. 

It turns out to be convenient to use the transformation 



xt = x{yf'' + r]t,e) + ^t, 



yt = yt' + 



(2.9) 



which yields a system of the form 



d6 = t, e) dt + -^F{^t, rit, t, e) dWt, ^ ^ 

e (2.10) 

dm = g{S.t, Vt,t, e) dt + a'G{^t,Vt, t, e) dWt, 
where the new drift and diffusion coefficients arc given by 

/(e, V, t, e) = f{x{yt' + v,e)+ yf * + V, e) 

- edyx{yt' + 7?, e)9{x{yt' + r/, e) + e, yt' + r},e)- £pVV(C, ??, t, e), 
F(e, r/, t, e) = F{x{yt' + r/, e) + 2/f * + e) 

- pV~edyx{yt' + V, e)G{x{yt' + 1 yt' + V, e), 
m r/, t, e) = g{x{yt' + r/, e) + yf + V,e)- g{x{yt\e),yt\e), 

G(e, rj, t, e) = G{x{yt' +V,e)+ C , yf + V,e). (2.11) 
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Here r{^,rj,t,e) stems from the contribution of the diffusion coefficients in (2.8) to the 
new drift coefficient, cf. Ito's formula. The Zth component of r{^,r),t,e) equals 

I TV {dyyxi{yf'^\ e)G{x{yf'' + v, e) + ^, yf^'+v, * + r/, e) + ^, yf^'+v, ef) , (2.12) 

where xi{y,£) denotes the Zth component of x{y,£), and dyyXi{y,e) the Hessian matrix of 
y ^ xi{y, e). In the sequel, we will only use the fact that each component of r(^, r), t, e) is 
at most of order m. 

Note that because of the property (2.5) of the adiabatic manifold, we have /(O, r], t, e) = 
—ep^a^r{0,r),t,e) = 0{mep'^a^). We introduce the notation 

A{yf'^\e) = d,f{x{yf-\e),yt\e) - edyx{yf'\e)d,g{x{yf-\e),yt\e) (2.13) 

as an approximation for the linearization of / at (0, 0,t,e), where we neglect the con- 
tribution of r(^,r/, t, e) to the linearization. Note that for e = 0, we have A{yf^^ ,0) = 
dxf{x{yf^^,0),yf^^,0) = A*{yf^^), so that by Assumption 2.2, the eigenvalues of A{yf^^,e) 
have negative real parts for sufficiently small e. 

One of the basic ideas of our approach is to compare the solutions of (2.10) with those 
of the "linear approximation" 

d^? = ]A{yf^\e)^^ dt + ^/o{yt\ e) dWu 
dyt'=g{x{yt\eUt\e)dt, 

where FQ{yf^^ ,e) = i^(0, 0, t, e). Note that the definition of the adiabatic manifold implies 
^o(?/;0) = F{x*{y),y,0). For fixed t, is a Gaussian random variable with covariance 
matrix 

Cov(e?) = - ['uit,s)Foiyf'\e)Foiy^'^\efU{t,sfds, (2.15) 
£ Jo 

where U{t, s) denotes the principal solution of the homogeneous system = A{yf'^^, e)^. 

We now observe that Cov(^^) is the X-variable of a particular solution of the 
deterministic slow-fast system 

eX = A{y, e)X + XA{y, + Fo{y, e)Fo{y, e)^, le) 
y = g{x(y,e),y,e). 

This system admits a slow manifold X = X*{y), given by the Lyapunov equation 

A*iy)X*{y) + X*iy)A*{yf + Fo(y, 0)Fo(y, 0)^ = 0, (2.17) 
which is known [3] to admit the (unique) solution 

X*{y)= e^^*(j')Fo(j/,0)Fo(y,0)^e^^*(s') ds. (2.18) 

^0 

Moreover, the eigenvalues of the operator X AX + XA^ are exactly Qi + aj, 1 ^ i, j ^ n, 
where are the eigenvalues of A. Thus the slow manifold X = X*{y) is uniformly 
asymptotically stable (for small enough e), so that Fenichel's theorem shows the existence 
of an adiabatic manifold 

X = X{y,e)=X*{y) + 0{e). (2.19) 
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Note that * , e) is uniquely determined by the "initial" value X(?/q^* , e) via the relation 

X{yt\e) = U{t) \xiy^'\s) + - f U{sr'Fo{yt\s)Fo{y^^ ,sfU{sr^ ds 

= Jo 



where Uit) = U{t,0) and U{s)-^ = [U{s)-Y- 
We now introduce the set 



U{tf, 
(2.20) 



B{h) = {{x,y):ye Vq, {{x - x{y, e)), X{y, e)-' {x - x{y, e))) Kh"}, (2.21) 

assuming that X{y,e) is invertible for all y G Vq. The set B{h) is a "layer" around the 
adiabatic manifold x = x{y,£), with ellipsoidal cross-section determined by X{y,e). For 
fixed t, the solution of the linear approximation (2.14) is concentrated (in density) in 
the cross-section of 13(a) taken at y^. Our first main result (Theorem 2.4 below) gives 
conditions under which the whole sample path {xt,yt) of the original equation (2.8) is 
likely to remain in such a set B{h). By 

TBih) = mf{t ^ : {xt, yt) B{h)} (2.22) 

we denote the first-exit time of the sample path (xt,yt) from B{h). In order to estimate 
the probability of Tg(/i) being small, we need to assume that X{y,e) and X{y,£)~^ are 
uniformly bounded in Vq, which excludes purely multiplicative noise. 

Remark 2.3. Fix y for the moment. If X*{y)~^ is bounded, then X(y,e)"^ is bounded 
for sufficiently small s. A sufficient condition for X*{y)~^ to be bounded is that the 
symmetric matrix Fo{y,0)FQ{y,0)'^ be positive definite. This condition is, however, by no 
means necessary. In fact, X*{y) is singular if and only if there exists a vector x ^ such 
that 

Fo{y, of e^^*(2/)^ Vs ^ 0, (2.23) 

which occurs if and only if 

x'^A^iyYFoiy, 0) = = 0, 1, 2, . . . (2.24) 

Because of the Cayley-Hamilton theorem, this relation holds for alH ^ provided it holds 
for Z = 0, . . . , n — 1. Conversely, X*{y) is nonsingular if and only if the matrix 

[Fo(y,0) A*iy)Fo{y,0) ... A*(y)"-iFo(y, 0)] G M^x"'^ (2.25) 

has full rank. This condition on the pair (yl*(?/), Fo(y, 0)) is known as controllability in 
control theory, where X*{y) is called a controllability Grammian. 

In what follows, we need (A*(y), i<o(y, 0)) to be controllable for all y G Pqj but in 
addition the smallest eigenvalue of X*{y) should be uniformly bounded away from zero. 

Theorem 2.4. Assume that \\X{y,e)\\ and ||X(y,e)~^|| are uniformly bounded in Vq. 
Choose a deterministic initial condition yo & Vq, xo = x{yQ,£), and let 

rvo=inf{s>0:ys^Vo}. (2.26) 

Then there exist constants eo, Ao,/io > (independent of the chosen initial condition yo) 
such that for all £ ^ £o; ^ ^ h ^ o,nd all < j < 1/2, the following assertions 
hold. 
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(a) The upper bound: For all t> Q, 



pO,(.o,.o){^^^^^ < t A J ^ C,n.,.,A(*> ^) (l + ^) e-'^"'^'/'^' , (2.27) 

where 

= 7[1 - O(^) - 0(A) - C»(m£p2) - o(e-™"^*/^ /(I - 27))] (2.28) 

and 

C+,^,,,a(*> ^) = [(1 - 27)-" + e"/^ + e-/^] . (2.29) 
(h) The lower bound: There exists to > of order 1 smc/i for all t > 0, 

pO,(.o,m,){^^(^^ < t} 5s C-^mit, e, h, a) e"'^"'*'/-^, (2.30) 



where 



and 



= i [1 + C)(/i) + 0(^Q-const{tAto)/e^^j (2.31) 



^n,mii^^^h,a) = const 



(2.32) 



(^cj General initial conditions: There exist 60 > and a time ti of order e\logh\ such 
that for all 6 ^ So, all initial conditions {xo,yo) which satisfy yo G Vq as well as 
{^0, X{yo, e)~^^o) < S^, and all t, t2 with t t2 ^ ti, 



dO,(«o,0) 



sup <6,X(y„£)-i6> > hA ^ Cl^,,,^{t,e)(l + ^) 

(2.33) 

where ^nm-y a(^^^) same prefactor as in (2.27), and 

= 7[1 - 0{h) - 0(A) - C)(mep2) _ const {t2Ai)/e _ 2^))] . (2.34) 

Unless explicitly stated, the error terms in the exponents k'^ and k~ are uniform in t, but 
they may depend on the dimensions n and m. 

Estimate (2.27) shows that for h^ a, paths starting in B{h) are far more likely to leave 
this set through the "border" {y G dT>o,{^,X{y,£)~^0 < h"^} than through the "sides" 
{y G mtVo,{^,X{y,e)~^^) = h"^}, unless we wait for time spans exponentially long in 
/i^/cj^. Below we discuss how to characterize tx>o more precisely, using information on the 
reduced dynamics on the adiabatic manifold. If, for instance, all deterministic solutions 
starting in Vq remain in this set, tdq will typically be very large. 

The upper bound (2.27) has been designed to yield the best possible exponent /c+, 
while the prefactor C^^^^ is certainly not optimal. Note that an estimate with the 
same exponent, but with a smaller prefactor holds for the probability that the endpoint 
{xt,yt) does not lie in B{h), cf. Corollary 3.10. The parameters A and 7 can be chosen 
arbitrarily within their intervals of definition. Taking A small and 7 close to 1/2 improves 
the exponent while increasing the prefactor. A convenient choice is to take A and 1/2 — 7 
of order h or e. The kind of time-dependence of Cj"^ ^ a is probably not optimal, but 
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the fact that ^ ^ increases with time is to be expected, since it reflects the fact that 
the probabihty of observing paths making excursions away from the adiabatic manifold 
increases with time. As for the dependence of the prcfactor on the dimensions n and m, it 
is due to the fact that the tails of standard Gaussian random variables show their typical 
decay only outside a ball of radius scaling with the square-root of the dimension. 

The upper bound (2.27) and lower bound (2.30) together show that the exponential 
rate of decay of the probability to leave the set B{h) before time t behaves like /i^/(2o"^) 
in the limit of a, e and h going to zero, as one would expect from other approaches, 
based for instance on the theory of large deviations. The bounds hold, however, in a full 
neighbourhood oi a = e = h = 0. 

Finally, Estimate (2.33) allows to extend these results to all initial conditions in a 
neighbourhood of order 1 of the adiabatic manifold. The only difference is that we have 
to wait for a time of order e|log/i.| before the path is likely to have reached the set B{h). 
After this time, typical paths behave as if they had started on the adiabatic manifold. 

Remark 2.5. In Theorem 2.4, the error terms in the exponents grow with the norms 
11/11 and ll^ll, and thus depend in general on the dimensions n and m. If the SDE (2.8) 
describes a large number of coupled similar subsystems (e. g. coupled oscillators), the error 
terms will not depend on the number of subsystems if, for instance, each one is coupled 
only to a finite number of neighbours. In mean-field type models, the error terms will be 
bounded if the interaction is properly scaled with the number of subsystems. 

The behaviour of typical paths depends essentially on the dynamics of the reduced 
deterministic system (2.7). In fact, in the proof of Theorem 2.4, we use the fact that yt 
does not differ too much from yf^^ on timcscales of order 1 (see Lemma 3.4). There are 
thus two main possibilities to be considered: 

• either the reduced flow is such that yf'^^ reaches the boundary of in a time of 
order 1 (for instance, yf^^ may approach a bifurcation set of the slow manifold); then 
yt is likely to leave Vq as well; 

• or the reduced flow is such that yf^^ remains in Pq for all times t ^ 0: in that case, 
paths can only leave B{h) due to the influence of noise, which we expect to be unlikely 
on subcxponcntial timcscales. 

We will discuss the first situation in more detail in Subsection 2.4. In both situations, 
it is desirable to have a more precise description of the deviation rft of the slow variable yt 
from its deterministic counterpart yf^^ , in order to achieve a better control of the first-exit 
time ■ 

The following coupled system gives a better approximation of the dynamics of (2.10) 
than the system (2.14): 

d^o = ^A(yf S £)e° dt + ^^Fo{yt\e) dWt, 
drft = [B{yt^,E)rft + C(yf \ e)^?] dt + a'G,{yt\ e) dWt, 
where Go(yt'^*,£) = G(0, 0, i,£) = G(x(y^®*, e), y^*^*, e) and the Jacobian matrices B and 
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C are given by 

= C{y'^'\e)dyx{yt^\e) + dyg{x{yt\e),yt'\e), (2.36) 
C{yt'\e) = dfg{d,Q,t,e) 

= d^g{x{yf-\e),yt\e). (2.37) 

The coupled system (2.35) can be written in compact form as 

dC° = A{yt\ ^)C° dt + aJ^o(yf S £) d^, (2.38) 

where (C°)^ = {{fY, {rfY) and 

A{yt\E)=[' ' , .^o(yf\6)=r^ . (2.39) 
The solution of the linear SDE (2.38) is given by 

0° = U{t)Co + (T fu{t, s)J'o (yf * , e) dWs , (2.40) 

Jo 

where U{t, s) denotes the principal solution of the homogeneous system ^ = A{yf'^^,e)(. 
It can be written in the form 

where U{t, s) and V{t, s) denote, respectively, the fundamental solutions of e.^ = A{yf'^^,e)$, 
and rj = B{yf^^,e)r], while 

S{t,s) = j\{t,u)C{yt^\e)U{u,s)du. (2.42) 

The Gaussian process has a covariance matrix of the form 

Cov(Ct°) = (7^ fu{t,s)Myf\e)Myf\^fm,sfds 

Jo 

_ 2p(i) Z{t)\ 

-'^ \Z{tf Y{t))- ^^-^^^ 

The matrices X{t) G R"''", Y{t) G R'^x™ and Z{t) G M"^"' are a particular solution of 
the following slow-fast system, which generalizes (2.16): 



(2.44) 



eX = A{y, e)X + XA{y, sf + Fo{y, e)Fo{y, sf, 
eZ = Aiy, e)Z + eZB{y, e)^ + eXC{y, ef + V^pFo{y, e)Go{y, , 
Y = B{y, e)Y + YB{y, + C(y, e)Z + Z^C{y, ef + p^Go{y, e)Go{y, ef, 
y = g{x{y,e),y,e). 

This system admits a slow manifold given by 

X = X*{y), 

Z = Z*{y, e) = -VipA{y, e)-'Fo{y, e)Go{y, ef + 0{s), (2.45) 
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where X*{y) is given by (2.18). It is straightforward to check that this manifold is uni- 
formly asymptotically stable for sufficiently small e, so that Fenichcl's theorem yields the 
existence of an adiabatic manifold X = X{y, e), Z = Z(y, e), at a distance of order e from 
the slow manifold. This manifold attracts nearby solutions of (2.44) exponentially fast, 
and thus asymptotically, the expectations of C?(C?)^ and C?(???)^ will be close, respectively, 
to a^Xiyf^'^e) and a^Ziyf^Ke). 

In general, the matrix Y{t) cannot be expected to approach some asymptotic value 
depending only on yf^^ and £. In fact, if the deterministic orbit yf^^ is repelling, ||y(t)|| 
can grow exponentially fast. In order to measure this growth, we introduce the functions 



X^'\t)= sup n sup \\V{s,v)\\)du, 
X^^\t)= sup n sup \\V{s,v)f)du. 

O^s^t Jo ^u^v^s ' 



(2.46) 
(2.47) 



The solution of (2.44) with initial condition 1^(0) = Iq satisfies 

Y{t-Y^) = V{i:)Y^V{tf (2.48) 
+ p2 r y(t, s)G^{yf\E)G^{yf\efV{t, sf ds + 0{{e + pVi)x^'^ (0)- 

We thus define an "asymptotic" covariance matrix Z{i) = Z{t;Yo,e) by 



Z{t;Yo,e)=i ^ 



X{y^'' 



t >< 



e) Z{yf^\e) 
Z(ydet^£)^ Y{t-Yo) 



(2.49) 



and use Z{t) ^ to characterize the ellipsoidal region in which C,{t) is concentrated. 



Theorem 2.6. Assume that \\X{y'^^^^e)\\ and \\X{yg^^ ,£)~^)\\ are uniformly hounded for 
^ s ^ t and that Iq has been chosen in such a way that ||l^(s)~^|| = C'(l/(p^ +£)) foT 
^ s ^ i. Fix an initial condition (xo,yo) with yo G Vq and xo = x{yo,e), and let t he 
such that y^^* G Pq for all s ^t. Define 



\z\ 



[o,t] 



1 + 



(i + r-^iiK:,)x«(o+x(^)(o 



There exist constants eo^^Oi^o > 0, independent ofYo, yo and t, such that 
pO,(o,o) 1 {C^^^z(u)-\u) > /i'l ^ C„+^,7,A(i,£)e-'^'*'/<^' 

holds, whenever e ^ £o; ^ ^ ^O; h ^ hoR{t)~^ and < 7 < 1/2. Here 



(2.50) 



(2.51) 



Cn+m,7,A(i,e) = const 



Ae 



1 - 27 



(n+m)/2 



+ e 



(n+m) /4 



K = 7 



i-o(£ + A + /iii:(i)) 



(2.52) 
(2.53) 



Let us first consider timescales of order 1. Then the functions ||2^||[o^t], X^^H^) 
X^'^\t) are at most of order 1, and ||y(t)~^|| remains of the same order as ||yQ~"''^||. The 
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probability (2.51) becomes small as soon as h ^ a. Because of the restriction h ^ 
hoR{t)''^, the result is useful provided ||^~^||[o,t] ^ cr~^. In order to obtain the optimal 
concentration result, we have to choose Yq according to two opposed criteria. On the one 
hand, we would like to choose Yq as small as possible, so that the set ((^uj ^{u)~^Cu) < h'^ 
is small. On the other hand, HI'q"^!! must not exceed certain bounds for Theorem 2.6 to 
be valid. Thus we require that 

Yo>[a''y{p' + e)\lm. (2.54) 

Because of the Gaussian decay of the probability (2.51) in cj//i, we can interpret the 
theorem by saying that the typical spreading of paths in the y-direction is of order a{p+y/e) 
if (7 < p + ^/£ and of order cr^ if cr > p + ^/e. 

The term p is clearly due to the intensity a' = pa of the noise acting on the slow 
variable. It prevails if /> > o" V ^/£. The term ^/£ is due to the linear part of the coupling 
between slow and fast variables, while the behaviour in cr^ observed when a > p-\- \fe can 
be traced back to the nonlinear coupling between slow and fast variables. 

For longer timescales, the condition h ^ hoR{t)~^ obliges us to take a larger Yq, while 
Y{t) typically grows with time. If the largest Lyapunov exponent of the deterministic 
orbit yf'^^ is positive, this growth is exponential in time, so that the spreading of paths 
along the adiabatic manifold will reach order 1 in a time of order logja V (p^ + e)|. 

Remark 2.7. Consider the reduced stochastic system 

dy? = g{x{yl e),yl e) dt + a'G{x{yl e),yl s) dWt (2.55) 

obtained by setting x equal to x{y,£) in (2.8). One may wonder whether j/° gives a better 
approximation of yt than yf^^ in the case cr' > 0. In fact, one can show that 

PW){ sup U-yr'\\>h^}^c(l + t)e-/'e.p{- "f\,..^J , 
P°'(°'°){ sup ||y.-yg|| ^4 ^c(l + ^)e-/^exp{- ,,,,'^f\,.,^J (2-56) 

+ + i) e™/" exp{- ^^^^^2^,2 + J^^^i + ^(2)(i)) } 

holds for all h, hi up to order x^^\t)~^ some positive constants c, ki,K2- (The proofs 
can be adapted from the proof of Lemma 3.4). This shows that the typical spreading 
of y° around yf®* is of order a' {I + x''^HO^''^) = P^(l + x'-^HO^^^)' while the typical 
deviation of paths of the reduced system from paths yt of the original system is of 
order ay/e{l + x^^H*)^^^)- Thus for p > -Je, the reduced stochastic system gives a better 
approximation of the dynamics than the deterministic one. 

If V{f) has no eigenvalues outside the unit circle, the spreading of paths will grow 
more slowly. As an important particular case, let us consider the situation where yf^* is 

an asymptotically stable periodic orbit with period T, entirely contained in Dq (and not 
too close to its boundary). Then all coefficients in (2.35) depend periodically on time, 
and, in particular, Floquet's theorem allows us to write 

V{t) = P{t) (2.57) 



14 



where P{t) is a T-periodic matrix. The asymptotic stabihty of the orbit means that all 
eigenvalues but one of the monodromy matrix A have strictly negative real parts, the last 
eigenvalue, which corresponds to translations along the orbit, being 0. In that case, X^^H^) 
and x^^\t) 

grow only linearly with time, so that the spreading of paths in the y-direction 
remains small on timescales of order l/(cr V (p^ + e)). 

In fact, we even expect this spreading to occur mainly along the periodic orbit, while 
the paths remain confined to a neighbourhood of the orbit on subcxponential timescales. 
To see that this is true, we can use a new set of variables in the neighbourhood of the 
orbit. In order not to introduce too many new notations, we will replace y by {y,z), 
where y G R"^~^ describes the degrees of freedom transversal to the orbit, and 2; G M 
parametrizes the motion along the orbit. In fact, we can use an equal-time parametrization 
of the orbit, so that i = 1 on the orbit, i. e., we have zf"^^ = t (mod T). The SDE takes 
the form 




dyt = g{xt, yt, zt, e) dt + a'G{xt, yt, zt, e) dWt, (2.58) 
dzt = [l + h{xt,yt,zt,s)] dt + a'H{xt,yt,zt,e)dWt, 

where h = 0(||yt|p + — and the Floquet multipliers associated with the periodic 

matrix dyg{xf^^ ,0, zf^^ , e) are strictly smaller than one in modulus. As linear approxima- 
tion of the dynamics of (^t, r)t) = {xt — xf^^,yt — yf^^) = {xt — xf^^, yt) we take 

de° = -A{zf'\e)^^ dt + ^Fo{zf'\e) dWt, 

dr/O = [B{zt-\e)4 + C(zf *, e)^?] dt + a'G^{zf-\e) dW^, (2.59) 

dz^ = dt + a'Ho{zf''\e) dWt, 



which depends periodically on time. One can again compute the covariance matrix of the 

Gaussian process {^t^rfl,Zi) as a function of the principal solutions U and V associated 
with A and B. In particular, the covariance matrix Y{t) of rfl still obeys the ODE 

Y = B{z'^^\e)Y + YB{z'^^\ef + C{z'^^\£)Z + Z'^C{z'^^\ef + p'^GQ{z^^\e)Go{z^^\e)'^ . 

(2.60) 

This is now a linear, inhomogeneous ODE with time-periodic coefficients. It is well known 
that such a system admits a unique periodic solution Y^^"^, which is of order + e since 
Z is of order p^/£ + e and p'^GqGq is of order p^. We can thus define an asymptotic 
covariance matrix Z{t) of {Ct'Vt)^ which depends periodically on time. If Q = {6.t,i]t)i 
Theorem 2.6 shows that on timescales of order 1 (at least), the paths Q are concentrated 
in a set of the form {Q, Z{t)~^Ct) < h^, while zt remains ^-close to zf^. 

On longer timescales, the distribution of paths will be smeared out along the periodic 
orbit. However, the same line of reasoning as in Section 3.2, based on a comparison with 
different deterministic solutions on successive time intervals of order 1, can be used to 
show that (t remains concentrated in the set {Q, Z{t)~^(t) < up to exponentially long 
timescales. 
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2.4 Bifurcations 



In the previous section, we have assumed that the slow manifold x = is uniformly 

asymptotically stable for y eVq. We consider now the situation arising when the reduced 
deterministic flow causes yf^^ to leave Vq, and to approach a bifurcation point of the slow 
manifold. 

We call a bifurcation point of the deterministic system 



if f{x, y,0) = and dxf{x, y, 0) has q eigenvalues on the imaginary axis, g G {1, . . . , n}. 
Wc consider here the situation where q < n and the other n — q eigenvalues have strictly 
negative real parts. 

The most generic cases are the saddle-node bifurcation (where q = 1), corresponding 
to a fold in the slow manifold, and the Hopf bifurcation (where q = 2), in which the slow 
manifold changes stability, while absorbing or expelling a family of periodic orbits. In these 
two cases, the set of bifurcation values y typically forms a codimension-1 submanifold of 



The dynamics of the deterministic slow-fast system (2.61) in a neighbourhood of the 
bifurcation point {x,y) can again be analyzed by a centre-manifold reduction. Introduce 
coordinates (x~,z) in M", with x~ G R""'^ and z G M'^, in which the matrix dxf{x,y,0) 
becomes block-diagonal, with a block A~ G R ^ having eigenvalues in the left 
half-plane, and a block ^l'^ G M^^^ having eigenvalues on the imaginary axis. On the fast 
timescale t/e, (2.61) can be rewritten as 



which admits {x~,z,y,0) as an equilibrium point. The linearization at this point has 
q + m + l eigenvalues on the imaginary axis (counting multiplicity) , which correspond to 
the directions z, y and e. In other words, z has become a slow variable near the bifurcation 
point. 

The centre manifold theorem implies the existence, for sufficiently small e and {z, y) 
in a neighbourhood M of (z, y), of a locally attracting invariant manifold x~ = x~{z, y, e), 
with x~{z,y,0) = x. x~ plays the same role the adiabatic manifold played in the stable 
case, and the dynamics on x~ is governed by the reduced equation 



ex = f{x,y,e), 
y = 9{x,y,e), 



(2.61) 




f~{x~,z,y,e), 
f{x~,z,y,£), 
£9{x~,z,y,e), 
0, 



(2.62) 



ez = f{x {z,y,e),z,y,e), 
y = 9{x~{z,y,e),z,y,£). 



(2.63) 



The function x {z, y, e) solves the PDE 



/ {x {z,y,e),z,y,s) = d^x {z,y,e)f {x {z,y,£), z,y,£) 

+ £dyx~ {z, y, e)g{x~ {z, y, e),z, y, e). 



(2.64) 
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dC = -f-i^r,zt,yt,t,e)dt + ^F-i^^,zt,yt,e)dWt, (2.66) 



Let us now turn to random perturbations of the slow-fast system (2.61). In the 
variables {x~,z,y), the perturbed system can be written as 

dXf = -f~{x^,zt,yt,e)dt + -^F~{xf,zt,yt,e)dWt, 
£ y £ 

dzt = -f{xt,zt, yt, e) dt + ^F\xJ,zt, yt, e) dWt, (2.65) 
dyt = g{xt ,zt,yt,£)dt + a'G{Xf , zt,yt, e) dWf. 

The noise-induced deviation of x^ from the adiabatic manifold is described by the variable 
^t^ = — x'~{zt, yt,£), which obeys an SDE of the form 

-ri^t ,zt,yt,t,e) dt + 

£ \J£ 

with, in particular, 

f~{Cz,y,t,£) = f~{x~{z,y,£) +C,z,y,£) - dzX~{z,y,£)f{x~{z,y,£)+C,z,y,£) 

- £dyX~{z,y,£)g{x~ {z,y,£) + C z,y,£) - a'^r~ , z,y,t,£), (2.67) 

where r~(^~, z,y,t,£) is at most of order m + q. Note that (2.64) implies that 

f- (0, z, y, t, e) = -a\- (0, z, y, t, e) = 0((m + q)a^). (2.68) 
We further define the matrix 

A~{z,y,£) = d:cf~ix~{z,y,£),z,y,£) - dzX~{z,y,£)dxf{x~{z,y,£),z,y,£) 

- £dyx~ {z,y,£)dxg{x~{z,y,£),z,y,£) (2.69) 

as an approximation to d^f~{0,z,y,t,£), where we neglect the contribution of r~. Since 
A~{z,y,0) = A~ , the eigenvalues of A~{z,y,£) have uniformly negative real parts, pro- 
vided we take the neighbourhood Af and £ small enough. 
Consider now the "linear approximation" 

dC? = -A- (zf * , ydet , £)^0 + (_^det ^ ^det ^ ^) ^ 

£ V ^ 

d^ft = l/0(x-(/^\/^\5),zr,yf*,e)dt, (2.70) 

dyf t = g{x-{z'"^\y'"'\£), S e) dt 

of (2.65)-(2.66), where FQ{z,y,£) = F~{0,z,y,£). Its solution has a Gaussian distri- 
bution with covariance matrix 

Cov(^O) = - fu-{t,s)F,-iz^^\y^^\£)F,-{z^^\y^^\£fU-{t,sfds, (2.71) 
£ Jo 

where U~ is the fundamental solution of £^'^ = Note that cr"^ Cov(^°) is the 

-variable of a particular solution of the slow-fast system 

£X- = A- {z, y, £)X- + X-A- {z, y, £f + Fq {z, y, £)F^ {z, y, £f, 

£z = f{x~{z,y,£),z,y,£), (2.72) 
y = g{x'{z,y,£),z,y,£), 
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which admits an invariant manifold X = X {z, y, e) for (z, y) G M. We thus expect the 
paths to be concentrated in a set 

= {{x~,z,y): {z,y) eAf,{x~ - x~{z,y,e),X~ {z,y,e)~^{x~ -x~{z,y,e))) < h^]. 

(2.73) 

The following theorem shows that this is indeed the CclSG ^ clS long as {zt,yt) remains in M. 

Theorem 2.8. Assume that \\X~ {z,y, e)\\ and ||X~(z, y, e)~"^)|| are uniformly bounded in 
N. Choose a deterministic initial condition {zo,yo) Xq = x~{zo,yo,£), and let 

TM = inf {s > : (z„ y,) N}. {2.1 A) 

Then there exist constants Hq > 0, Aq > and v G (0, 1] such that for all h ^ Hq, all 
A ^ Ao and allO <j < 1/2, 

^0,ix^,zo,yo){^^_^^^^tATM}^Cn,rn,g,,Ait,e)(^l+'^^ (2.75) 

provided e\log{h{l — 2'y))\ ^ 1. Here 

K = j[l~ 0{A) - 0{h%l - 27)i-niog(/i(l - 27))|)] , (2.76) 

Cn,m,,,^Ai-t, e) = const (l + ^) (l + ^) [(1 - 27)-^"-^) + e^"-')/^ + e^/^ + e«/^" . 

(2.77) 

The exponent is related to the maximal rate of divergence of solutions of the reduced 
system (2.63), sec Subsection 5.1. 

This result shows that on timescales of order 1 (and larger if, e.g., M is positively 
invariant), paths are likely to remain in a small neighbourhood of the adiabatic manifold 
x~ = x~{z,y,e). The dynamics will thus be essentially governed by the behaviour of the 
"slow" variables z and y. 

In fact, it seems plausible that the dynamics of (2.65) will be well approximated by 
the dynamics of the reduced stochastic system 

dz? = -j\x-izly't,e),zlyle)dt + ^F\x-{zlyle),zlyle)dW,, 

dy° = g{x- {zl yl s), z?,yl s) dt + a'G{x-{zl yl e),zI e) dWt, 

obtained by setting x~ equal to x~(z,y,e) in (2.65). This turns out to be true under 
certain hypotheses on the solutions of (2.78). Let us fix an initial condition (2:0,^0) ^ -^j 
and call C° = {zt,yt) the corresponding process. We define the (random) matrices 



ed^g edyg 



(2.79) 

x=x- {z^,y^,e),z=z^,y=y^ 



(2.80) 

x=x-{z°,y°,e),z=z°,y=y° 



Observe that C{{z, y),0) =0 because of our choice of coordinates, so that ||C(C°, e)|| will 
be small in a neighbourhood of the origin. We denote, for each realization ('^{uj), by V^, 
the principal solution of 

dCt(a;) = -B{C^{u),s)CtHdt. (2.81) 
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(Note that we may assume that almost ah reahzations C^{uj) are continuous.) We need 
to assume the existence of deterministic functions ■d{t,s), ??c'(ij'S)) and a stopping time 
r ^ '''B-{h) such that 

||Va,(t,s)|| ||V^(t,s)C(CV),£)H^c(i,s) (2.82) 

hold for all s ^ t ^ and (almost) all paths (Cu('^))«>o of (2.78). Then we define 

X^^it) = sup - /" ■d{s,uydu, 

X%\t) = sup - /'( sup ^c{s,v)') du (2.83) 
for i = 1, 2, and the following result holds. 

Theorem 2.9. Assume that there exist constants A,i?o > (of order 1) such that 

'd{s,u) ^ i3q and 'dc{s,u) ^ i?o whenever < s — u ^ Ae. Then there exist con- 
stants hQ,KQ > such that for all h ^ ho[x^^\t) V Xc\^)]~^ ^^'^ initial conditions 
{x',,zlyl)eB-{h), 



.0,K-,.0,,0)| ||(^„y,)-(^O^yO)|| 

^ 1 

'''\tht)+hx^l){t)+h^x^^){t) 



where 

Cm,q{t, e) = const (1 + ^) e("^+«)/^ . (2.85) 

This result shows that typical solutions of the reduced system (2.78) approximate 
solutions of the initial system (2.65) to order (Tx[?^(t)^/^ + a^Xc\^)^ as long as x''^\'t) 
1/a. Checking the validity of Condition (2.82) for a reasonable stopping time r is, of 
course, not straightforward, but it depends only on the dynamics of the reduced system, 
which is usually easier to analyze. 

Example 2.10. Assume the reduced equation has the form 

drf = 1. 

i.e., there is a pitchfork bifurcation at the origin. We fix an initial time to < and 
choose an initial condition (zo^yo) with yo = to, so that y° = t. In [7] we proved that if 

a ^ y/e, the paths {zs}s^to arc concentrated, up to time -^e, in a strip of width of order 
cr/dy^'l^/^ V e^/^) around the corresponding deterministic solution. 

Using for r the first-exit time from a set of this form, one finds that Xc (v^) is of 
order ^/s + a'^/£ and that Xc\V^) order 1 + Thus, up to time ^/e, the typical 

spreading of Zg around reduced solutions Zg is at most of order cre^/^ + a'^/^/e, which 
is smaller than the spreading of z^ around a deterministic solution. Hence the reduced 
system provides a good approximation to the full system up to time ^/£. 
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For larger times, however, x^c (^) g^ows like e*^/^ until the paths leave a neighbourhood 
of the unstable equilibrium 2; = 0, which typically occurs at a time of order y^ejlogo^. 
Thus the spreading is too fast for the reduced system to provide a good approximation to 
the dynamics. This shows that Theorem 2.9 is not quite sufficient to reduce the problem 
to a one-dimensional one, and a more detailed description has to be used for the region of 
instability. 

3 Proofs - Exit from B{h) 

In this section, we consider the SDE 

^t = -f{xuyt,e)^t+-^F{xt,yue)dWu , ^ 

£ (3.1) 

Ayt = g{xt,yt, e) dt + a'G{xt, yt,e) dWt 

under Assumption 2.2, that is, when starting near a uniformly asymptotically stable man- 
ifold. We denote by {xf'^^,yf'^^), with xf*^^ = a;(y^^*,e), the deterministic solution starting 
in ydet = yoeVo- 

The transformation 

xt = x{yf^' + rit,e) + Ct, 

yields a system of the form (2.10), which can be written, using Taylor expansions, as 

dCt = - [^(yf S £)6 + K^t, rit, t, £)] dt + ^ [Fo{yf''\s) + Fi(6, r?*, t, s)] dWt, (3.3) 

£ \j e 

d% = {C{yf\E)it + B{yf\E)T)t + c(6, ry*, t, e)\ dt + a' [Go(yf S e) + Gi (6, r?*, t, e)\ dWt. 
There are constants M, M\ such that the remainder terms satisfy the bounds 

||6(C,r?,t,e)K M(||ef + ||e||||r/|| +m£pV), 

||c(C,r?,t,£)KM(||Cf + ||r,f), 
||Fi(C,77,t,£)|| ^Mi(||C|| + ||7?||), 
||Gi(C,r/,t,£)HMi(||e|| + ||r/||) 

for all (^,77) in a compact set and all t such that y^^ G Pq- Note that M and Mi may 
depend on the dimensions n and m (see Remark 2.5). The term mep^a'^ stems from the 
term r{$,,r],t,e) in (2.11). We shall highlight its m-dependence since it will in general be 
unavoidable. 

3.1 Timescales of order 1 

We first examine the behaviour of ^„ on an interval [s, t] with A = (t — s)/e = Oe{l)- For 
this purpose, we fix an initial condition yo € Vq and assume that t is chosen in such a way 
that y^^* G Vq for all u^t. 
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To ease notations, we will not indicate the e-dependence of X{y). We assume that 
||X(y)|| ^ and ^ if_ for all y G Vq, and define the functions 

= - [)\U{t,ufX{y^^Y'Uit,u)\\ du, 
^ Jo 

m = - fTT[U{t,ufX{yf'')-'U{t,u)] du, (3.5) 
£ Jo 

©(*) = - / \\U{t,u)\\du, 
£ Jo 

where U{t,u) again denotes the principal solution of = A{yf^^,e)^. Note that the 
stability of the adiabatic manifold implies that ||t/(t,7i)|| is bounded by a constant times 
exp{— A'o(t — u)/e}, Kq > 0, for all t and u ^t. Hence \l'(t) and @{t) are of order 1, while 
$(t) is of order n. In particular, $(f) ^ n^{t) holds for all times t. 

We first concentrate on upper estimates on the probabilities and will deal with the 
lower bound in Corollary 3.5. Let us remark that on timescales of order 1, we may safely 
assume that the deviation ijs of yg from its deterministic counterpart remains small. We 
fix a deterministic hi > and define 

Trj = infjs > 0: ||r7s|| ^ /ii}. (3-6) 

Lemma 3.4 below provides an estimate on the tails of the distribution of r^. The follow- 
ing proposition estimates the probability that xt leaves a "layer" similar to B{h) during 
the time interval [s, t] despite of rju remaining small. Note that in the proposition the 
"thickness of the layer" is measured at y^^^ instead of yu- 

Proposition 3.1. For all a G [0, 1), all 7 G (0, 1/2) and all n > 0, 

sup pO,«o,0)r {^,,X{y^-Y%)^hA 



^ -^exp<^ -7^ 



{ [1 - «^ - ^o(A + (1 + ix)h + {h + /ii)e(i))] I 



(1 - 27)"/2 

+ ^m/^) e^p [ M^(l-MoA) ) 

holds for all h < 1/ fi, with a constant Mq depending only on the linearization A of f , K^, 
M , II-FqIIoo; cmd on the dimensions n and m via M . 

Proof: The solution of (3.3) can be written as 

in = U{u)io + ^ rU{u,v)Fo (yf * , s) dW, (3.8) 
Jo 

C7 1 

+ U{u,v)Fi{^y,riy,v,£)dWy + - U{u,v)b{^y,r]y,v,e) dv, 

Jo £ Jo 

where U(u) = U(u,0) as before. Writing ^„ = {7(ti,s)T„ and defining 

= inf{u ^ 0: {^u,X{yt''')-%) > /i'}, (3.9) 
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the probability on the left-hand side of (3.7) can be rewritten as 

p = pO,(€o,o)f ||Q(„)T„|| ^ ^1, (3.10) 

where Q{u) = Qs{u) is the symmetric matrix defined by 

Qiuf = U{u, sfX{yt''r'U{u, s). (3.11) 
To eliminate the u-dependence of Q in (3.10), we estimate P by 

P^pO,(eo,o)j ||Q(t)T„|| (3.12) 



where 



H = h( sup \\Q{u)Q{t)-'^\\] . (3.13) 

In order to estimate the supremum in (3.13), we use the fact that Q{v)~'^ satisfies the 
differential equation 

= ^U{s,v)Foiy'^'^\e)Foiyt'^\efU{s,vf, (3.14) 

and thus 

g(n)2Q(i)-2 = ii + Q(^)2l ['u{s,v)Fo{y'^'\e)Fo{yt''\efU{s,vfdv = 1 + 0{A). 

(3.15) 

(Recall that t — u^t — eA in this subsection, which implies ||?7(s,u)|| = 1 + 0{A) 
and \\Q{uf\\ ^K_{1 + 0(A)).) Therefore, H = h{l - C(A)). 

We now split T„ into three parts, writing = + + T^, where 

r'i = Uis)C, + ^ U{s,v)Foiy^'\e)dW,, 
Jo 

Tl = ^ rU{s,v)Fii^,,n,,v,e)dW,, (3.16) 
Jo 

Jl = - U{s,v)b{^y,riy,v,e)dv, 
£ Jo 

and estimate P by the sum of the corresponding probabilities 

p^ = pO,(eo,0)f g^p ||Q(t)TO|| ^iJo 

= pO,«o,o) j 3^p ||Q(i)Ti|| ^ hX (3.17) 
P2 = pO,«o,o)f g^p ||g(i)T2|| ^ijj, 

Ls^U^tATjATr, J 
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where Ho,Hi,H2 satisfy Hq + Hi + H2 = H. Note that P2 can be estimated trivially 
using the fact that 



sup ||Q(i)T^|| < ^/K^M{K+h^ + ^/K+hhi + mep^a^){l + 0{A))@{t) := H2. 

S^U^tAT^ATri 

(3.18) 

Now, we choose 

H2 = 2H2, 

Hi = fihH, (3.19) 
Hq = H — Hi — H2 

for < /X < l//i, and estimate the remaining probabilities Pq and Pi by Lemmas 3.2 
and 3.3 below. When estimating Hq, we may assume MohQ{t) < 1, the bound (3.7) being 
trivial otherwise. □ 

Lemma 3.2. Under the hypotheses of Proposition 3.1, we have for every 7 G (0, 1/2), 
holding uniformly for all ^0 such that {^o,X{yQ)~^^Q) ^ a^h^. 

Proof: For every 7 > 0, (exp{7||Q(t)T^p})„^s is a positive submartingale and, there- 
fore, Doob's submartingale inequality yields 

Po = po,(eo,o) r e7iiQWTSf ^ e^^oj ^ g-^^o ^m^fi) [^immn^y (3.21) 

Now, the random variable Q{t)1^ is Gaussian, with expectation E = Q{t)U{s)^o and 
covariance matrix 

S = ^Qit)(^jyis,v)Foiyt'^\e)Fo{y^'^\efUis,vfdv^Q{tf. (3.22) 
Thus, using completion of squares to compute the Gaussian integral, we find 

Jy(E,n-2^T,)-'^E) 

^ ^" (det[]l-27S])V2- ^^-^^^ 

By (2.20), we can write 

S = a'Q{t)U{s, t) [X(yf t) - U{t)X{y^<^')U{tf] U{s, tfQ{tf = a^[l- RR^] , (3.24) 

where R = Q{t)U{s)X{y^''^)^/^, and we have used the fact that U{s,t)X{yf''^)U{s,tf = 
Q{t)~'^. This shows in particular that 

det[]l - 27S] ^ (1 - 27cr2)". (3.25) 

Moreover, since ||PP^|| = ||P"^P|| G (0,1), we also have 

{E,{t-2^T.)-^E) = (X(yd<=t)-V2^o,i?'r(]l-27S)-ipX(yo"'^*)-V2^o) 

^ a^h^R^{t - 2^a'^[t - PP^])"^P|| 

^a^/i2([l-27(72]||P^P||-i + 27(72)"^ ^a^h^ (3.26) 

for all ^0 satisfying (Co, ^(yo)~''^^o) ^ a^h'^- Now, (3.20) follows from (3.23) by choosing 
7 = 7/(7^. □ 
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Lemma 3.3. Under the hypotheses of Proposition 3.1, 

(3.27) 

holds uniformly for all such that (^O) ^(yo)~^Co) ^ h^- 
Proof: Let r denote the stopping time 

r = AT^Ainf{« ^ 0: \\Q{t)Tl\\ ^ Hi}, (3.28) 
and define, for a given 71, the stochastic process 

(3.29) 



,7i||Q{t)Tif 



(Hij)„ being a positive submartingale, another apphcation of Doob's submartingale in- 
equahty yields 

Pi ^ e-^i-^? EO'(^°'0){Hm.}. (3.30) 

Ito's formula (together with the fact that (dWu)^ RdWu = Tt{R^R) du for any matrix 
R g R^x'^) shows that obeys the SDE 



r2 



dS„ = 27i^S„(Ti)^Q(t)2c/(s,u)Fi(e„,r?„,u,e) dW„+7i^S„ TV[i?f i?i + 27ii?i^P2] du, 

(3.31) 

(3.32) 



where 

i?i = Q{t)U{s,u)Fi{^u,Vu,u,e), 
R2 = iTlfQ{tfUis,u)Fi{^u,riu,u,e). 

The first term in the trace can be estimated as 

,2 



Tr[RjRi] = TV[i?ii?f] ^ M^{Uu\\ + hu\\) TT[Q{tfU{s,u)U{s,ufQ{t)] 

^M^{UJ + \\7^J)'T,[Uit,ufX{yf'Y'U{t,u)], (3.33) 

while the second term satisfies the bound 

Tt[R'^R2] = \\Fi{^u,Vu,u,efU{s,ufQ{tfTif 

< mUUuW + \\vu\\f\\U{s,ufQit)\f\\Qit)Tif 

= M2(||e„|| + ||r?„||)'||i7(t,u)^X(yf*)-i[/(i,«)||||Q(t)Ti|p. (3.34) 

Using the fact that ^ \/K^h, ||77„|| ^ hi and ||Q(t)Ti|| ^ Hi hold for all ^ u ^ t/\T, 
we obtain 

£ Jo 
X "!¥[[/(*, t;)^X(j/f *)-![/(*, ^;)] + 271/^2 ^)T^(ydet)-i^(^^ ^335^ 
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and Gronwall's inequality yields 

EO,«o,o)|5^^^| ^ expjTia^Mf (v^/i + /ii) + 271 (3.36) 
Now, (3.30) implies 

Pi ^ exp{-7i(/fi2_^2^2(^/^^^^^)2^(^)^^272a2Mf (V^/t+/ii)^i?i*(t)}, (3.37) 

and (3.27) follows by optimizing over 71. □ 

Proposition 3.1 allows to control the first-exit time of {xt,yt) from B{h). provided 
Vs = ys~ yf"^^ remains small. In order to complete the proof of Part (a) of Theorem 2.4 we 
need to control the tails of the distribution of r^. The following lemma provides a rough 
a priori estimate which is sufficient for the time being. We will provide more precise 
estimates in the next section. 

Recall the notations V{u,v) for the principal solution of ?) = B{y'^^ ,e)r], and 



X^''\t)= sup fi sup \\V{s,v)\\)du, 
sup / ( sup ||F(s,i;)||^ j du. 



(3.38) 
(3.39) 



from Subsection 2.3. 



Lemma 3.4. There exists a constant > such that for all choices oft>0 and hi > 
satisfying y^^^ G Vq for all s ^t and hi ^ c,]X^^\'t)~^ , 



_sup pO-(«0,0)| 



< 2 



t 



e"^/^ exp 



sup ||?7u|| ^ hi 

2 



mep" 1 

(p2+£)^(2)(t)/ 



X exp< —kq 



hl{l-0{Ae)) 

V2(p2+£)^(2)(^)L 



1-M[,x^'\t)hi(l + K+^) 



(3.40) 



where kq > is a constant depending only on ||F||oo; ||G||oo; ||C'||oo ctnd U, while the con- 
stant Mq depends only on M, ||C||oo o,nd U. Note that may depend on the dimensions n 
and m via M . 

In the sequel, we will typically choose hi 3> cr, so that the prefactor becomes negligible. 

Proof of Lemma 3.4. We first consider a time interval [s,t] with t — s = Ae. Let 
u G [s, t\ and recall the defining SDE (3.3) for rju- Its solution can be split into four parts, 
riu = rii + ril + ril + rjl, where 



a' / V{u,v)G{^^,ri^,v,£)dWy, 
Jo 

/ S{u,v)F{Cy,r]y,v,e)dW^, 
Jo 

vl= V{u,v)c{^v,riv,v,£)dv, 
Jo 

S{u,v)b{^v,r]-a,v,e) dv, 



(3.41) 



4 



25 



with 

S{u,v)= V{u,w)C{yi^\e)U{w,v)dw. (3.42) 

Jv 

Let T = Tg(/i) A r^. It follows immediately from the definitions of re(/i), and the 
bounds (3.4) that 

\\7^l^4^M{l + 0{Ae))x'^'\t){K+h' + hl), 
\\r)l^^\\ ^ M'x^^\t){K+h^ + ./K^hhi + mep'^a^) 

for all u G [s,t]. Here M' depends only on M, U and ||C||oo- Furthermore, using similar 
ideas as in the proof of Lemma 3.3, it is straightforward to establish for all Ho,Hi > 
that 



'I sup WrilW ^ Hi] ^ e'^/^ ex.p\ 



l)0,({o,0). 



l|G||2,x(2)(t), 

if,^(l-0(Ag)) 
8a%cs||F||2,x(2)(t) 



(3.44) 



where cs is a constant depending only on S. Then the local analogue of estimate (3.40) 
(without the t-dependent prefactor) is obtained by taking, for instance, Hq = Hi = 
^hi-2{M + M')x'^'^\t){K+h? + hl + mep'^a^), and using hi ^ c^x^^H*)~^ where we may 
choose Cr, ^ 1/{2M(^). 

It remains to extend (3.40) to a general time interval [0,t] for t of order 1. For this 
purpose, we choose a partition = uq < ui < ■ ■ ■ < uk = t of [0,t], satisfying = kAe 
iov ^ k < K = \t/{A£)~\. Applying the local version of (3.40) to each interval [^^,1x^+1] 
and using the monotonicity of x^^^ (u) , the claimed estimate follows from 

K-l 

pO,(€o,o)r gyp ||r/„|| ^ /iij ^ ^ pO'«o>o)| sup \\r]u\\^hi\. (3.45) 

□ 

We will now show that Proposition 3.1 and Lemma 3.4 together are sufficient to prove 
Parts (a) and (b) of Theorem 2.4 on a timescale of order 1. We continue to assume that 
yo € ^0 but we will no longer assume that y^^* G Pq automatically holds for all u ^ t. 
Instead, we will employ Lemma 3.4 to compare yu G Pq and y^*^*, taking advantage of the 
fact that on timescales of order 1, r]t is likely to remain small. Note that if the uniform- 
hyperbolicity Assumption 2.2 holds for Vq, then there exists a 6 > oi order 1 such that 
the (5-neighbourhood ^^{5) also satisfies this assumption. We introduce the first-exit time 
Tp^* of the deterministic process y^^* from {S) as 

r^f =inf^^O:yr0Po^(5)} (3.46) 
and remark in passing that Te(/j) A ^ r^^* holds whenever hi ^ 5. 

Corollary 3.5. Fix a time t > and h > in such a way that h ^ cix^^"^ {t At^^)~^ for a 

sufficiently small constant ci > and X*'^H^ ^ '^Vq) ^ (p^ +£)~^ . Then for any a G [0, 1), 
any 7 G (0, 1/2) and any sufficiently small A, 

C-^{t, s) e-- (o)'^Vc^^ ^ pO,«o,o) 1^^^^^ < i| ^ d^^^^t, e) e^+W'^Vc^^ (3.47) 
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holds uniformly for all satisfying (Co)-^(yo) ^^o) ^ a^h"^- Here 

K+{a) = 7[1 - a2 _ o(A) - 0{mep^) - + x^'\t A T%f))h)] , 

(l-27)"/2 ^ ^ 



TT cr 

e"/4 +4 



(3.48) 
(3.49) 
(3.50) 

(3.51) 



Ae 



-,m/4 



e 2o- 



Proof: We first establish the upper bound. Fix an initial condition (^O)O) satisfying 
{^o,X{yQ)~^^Q) ^ a^/i^, and observe that 

<t}^ < t ^ ^v} + P°'«-°){r, < t A rg(,)} (3.52) 

= P°'(«°'°){rB(/.) < t A r^gf A r,} + pO'(«'"«){r, < t A 4f A r6(,)}. 

To estimate the first term on the right-hand side, we again introduce a partition = ito < 
ui < • • • < UK = t of the time interval [0, t] , defined by Uk = kAe iov ^ k < K = 
\t/{Ae)]. Thus we obtain 

K 

< t A 4f A Tr,} ^ ^P0'(«0''^){n,_i ^ rs(,) < A 4f A r,}. (3.53) 
k=l 

Before we estimate the summands on the right-hand side of (3.53), note that by the 
boundedness assumption on ||X(y)|| and ||X^-'^(?/)||, we have X (?/„)" -"^ = X{y'^'^*^)^^ +0{hi) 
for u ^ T^^* A r^. Thus the bound obtained in Proposition 3.1 can also be applied to 
estimate first-exit times from B{h) itself: 



i>0,Ko,0) 



^ pO,(€o,o) f {^u,X{yt'T'Q > h\l - 0{h,))], (3.54) 

Uk-l4^U<Uk/\T^^/\Tr, 

while the second term on the right-hand side of (3.52) can be estimated directly by 
Lemma 3.4. Choosing 

= %Ml [v/:^ + h^/{h{l - 0{hi)))\ ^^{t A T^^^)/ [1 - 0(/ii) - MoA] (3.55) 

and h\ = h/-^fKQ in the resulting expression, we see that the Gaussian part of gives the 
major contribution to the probability. Thus we obtain that the probability in (3.52) is 
bounded by 



Ae 



K 



+ 2e"^/Sxp 



h?{l - 0{x^^\t A r^f )/i) - 0(Ae) - 0{mep'^)) 



a2(p2+£);^(2)(4;V^det^ 



(3.56) 
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where we have used the fact that ^ n'^{t), while '^{t) and @{t) are at most of order 1. 
The prcfactor e"^^P can be absorbed into the error term Oimep^) in the exponent. This 
completes the proof of the upper bound in (3.47). 

The lower bound is a consequence of the fact that the Gaussian part of gives the 
major contribution to the probability in (3.47). To check this, we split the probability as 
follows: 

= pO,{«o,o)|~^ < t Ar,} -P0'«0'0){f^ <T^< t} +P°'(«°'°){^BW < t,T^ < t} 

^ pO,(«o,o)|^^ < t Ar^} -pO.«o,o)|^^ <tArs(h)}, (3.57) 

where 

= inf{t/ ^ 0: (^„,X(yf*)-iC„> ^ /^^(l + 0(/ii))}, (3.58) 

and the C'(/ii)-term stems from estimating X{yu)~^ by X(7/^'^*)~"^ as in (3.54). The first 
term on the last line of (3.57) can be estimated as in the proof of Proposition 3.1: A lower 
bound is obtained trivially by considering the endpoint instead of the whole path, and 
instead of applying Lemma 3.2, the Gaussian contribution can be estimated below by a 
straightforward calculation. The non-Gaussian parts are estimated above as before and are 
of smaller order. Finally, we need an upper bound for the probability that <t /\ Tj^{h)i 
which can be obtained from Lemma 3.4. □ 

3.2 Longer timescales 

Corollary 3.5 describes the dynamics on a timescale of order 1, or even on a slightly 
longer timescale if x^^\t)^ X^'^\t) do not grow too fast. It may happen, however, that 
ydct ]-gj]2ains in T>q for all positive times (e. g. when T>q is positively invariant under the 
reduced deterministic flow). In such a case, one would expect the vast majority of paths 
to remain concentrated in B{h) for a rather long period of time. 

The approach used in Subsection 3.1 fails to control the dynamics on timescales on 
which x^'''\i) ^ 1) because it uses in an essential way the fact that rjt = yt — yf^^ remains 
small. Our strategy in order to describe the paths on longer timescales is to compare 
them to different deterministic solutions on time intervals [0,T], [T,2T], where T 
is a possibly large constant such that Corollary 3.5 holds on time intervals of length T, 
provided yt remains in Dq . Essential ingredients for this approach are the Markov property 
and the following technical lemma, which is based on integration by parts. 

Lemma 3.6. Fix constants si ^ S2 in [0,oo], and assume we are given two continuously 
differentiable functions 

• (/p : [0, do) — > [0, oo), which is monotonously increasing and satisfies ip{s2) = 1, 

• : [0, oo) M which satisfies (Pq[s) ^ for all s ^ si. 

Let X ^ be a random variable such that F{X < s} ^ Vq{s) for all s ^ 0. Then we have, 
for all t ^ 0, 

E{l[o,i)(X)^(X)} ^ <t}- v\s)Ms)ds, (3.59) 

J SI At 

where (p{s) = ip{s) A 1. 
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We omit the proof of this result, which is rather standard. See, for instance, [7, 
Lemma A.l] for a very similar result. 

When applying the preceding lemma, we will also need an estimate on the probability 
that (^T, -^(j/t)~^^t) exceeds h^. Corollary 3.5 provides, of course, such an estimate, but 
since it applies to the whole path, it does not give optimal bounds for the endpoint. An 
improved bound is given by the following lemma. Recall the definition of the first-exit 
time TVo of yt from Vq from (2.26). 

Lemma 3.7. // T and h satisfy h ^ cix'^^^T A r^f )~^ and x^^HT A T%f) ^ (p^ + e)'^, 
we have, for every 7 G (0, 1/2), 

_sup P°'(«°'°)|(eT,X(j/T)-'M > h\Tv, ^ tW Cn,m,,{T,e)e--'^''''\ 

(3.60) 

where 

k' = j[l- 0(A) - 0{h) - 0(e-2-^°^/^ /(I - 27))] , (3.61) 
Cn,m,,{T, e) = j^^^n^ + 4C7+^,,(r, e) e'^^^^'^^ ■ (3.62) 
Proof: We decompose as = ^° + ^1 + where 

e° = U{t)io + ^ f U{t,u)FQ{yt-\e)dWu, 

^1 = ^ U{t,u)Fi{^u,r}u,u,e)dWu, (3.63) 
Jo 

1 /"* 

it=-l U {t, u)b{^u, Vu, u, e) du, 

£ Jo 

and introduce the notations and for the stopping times which are defined like 
and in (3.9) and (3.6), but with h and hi replaced by 2h and 2hi, respectively. The 
probability in (3.60) is bounded by 

pO,(«o,o)|^^^^^(ydet)-i^^^ ^ h\l - 0{hi)),fr, > r} +P0.«o.o)|^^ ^ t]. (3.64) 

Let = h^{l — 0{hi)). As in the proof of Proposition 3.1, the first term can be further 
decomposed as 

^ pO-(«o,0)|||^(ydet)-l/2^0 II ^ i^^^l +pO,(€o,0)|^^ > T,f^ ^ r} 
+ pO,(«o,0)|||^(ydet)-l/2^1^|| ^ > j.^ > 

+ pO,{«o,0)|||^(ydet)-l/2^2|| ^ > r,f^ > r}, (3.65) 

where we choose Hi, H2 twice as large as in the proof of Proposition 3.1, while Hq = 
H — Hi — H2. 

The first term on the right-hand side can be estimated as in Lemma 3.2, with the 
difference that, the expectation of being exponentially small in T/e, it leads only to a 
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correction of order Q-'^^oT/e /(i _ 27) in the exponent. The second and the third term can 
be estimated by Corollary 3.5 and Lemma 3.3, the only difference lying in a larger absolute 
value of the exponent, because we enlarged h and hi. The last term vanishes by our choice 
of H2. Finally, the second term in (3.64) can be estimated by splitting according to the 
value of Tg(2fe) and applying Lemma 3.4 and Corollary 3.5. □ 

We are now ready to establish an improved estimate on the distribution of rg^/j-j. As 
we will restart the process yf^ whenever f is a multiple of T, we need the assumptions 
made in the previous section to hold uniformly in the initial condition yo € Pq- Therefore 
we will introduce replacements for some of the notations introduced before. Note that 
X^'^Ht) = Xyo{i) and x'^'^\'t) = xfa i'^) depend on yo via the principal solution V. Also 
'^•Do* ~ '^ifo^yo) naturally depends on yo- We define 

xW(t)= sup 4)(tA4f(yo)), (3.66) 

3/oeX'o 

X^'\t)= sup xi?(*A<*(yo)). (3.67) 
2/0 e^o 

In the same spirit, the x*^*)(r)-dependent C>(-)-terms in the definitions of «+(«), k' and 
the prefactors like C^^^(r, e) are modified. 

Wc fix a time T of order 1 satisfying x'-^-*(T) ^ (p^ + e)~^. T is chosen in such a way 
that whenever h ^ cix^^\T)~^, Corollary 3.5 (and Lemma 3.7) apply. Note that larger 
T would be possible unless p is of order 1, but for larger T the constraint on h becomes 
more restrictive which is not desirable. Having chosen T, we define the probabilities 

Pk{h) = < kTATVo}, (3.68) 

Qk{h) = P°'(°'°){(CfcT,^(2/fcT)-'CfcT) ^ h^Tvo ^ kT}. (3.69) 

Corollary 3.5 provides a bound for Pi{h), and Lemma 3.7 provides a bound for Qi{h). 
Subsequent bounds are computed by induction, and the following proposition describes 
one induction step. 

Proposition 3.8. Let k ^ k+(0) A k' . Assume that for some A; G N, 



Pfe(/i)^L>fee-«'^>', (3.70) 

Qkih)^Dke-''^"/-\ (3.71) 

Then the same bounds hold for k replaced by k + 1, provided 

Z?fc+i ^ A + C+^,^(r,£)5fe-^e(^-'^)'^Vc^^ (3.72) 

Dk+i^ Dk + Cn,n.,^{T,e). (3.73) 



Remark 3.9. Below we will optimize with respect to k, but note that in the case k~^(0) = 
k' = 7, we may either choose k < k+(0) A k', or we may replace (3.72) by 



(3.74) 
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Proof of Proposition 3.8. We start by establishing (3.73). The Markov property al- 
lows for the decomposition 

Qk+i{h) 

+ eO'(O'O) { ^feT}P'^'(^'="'°) { {Cik+i)T, X{y(k+i)Tr'C(k+i)T) > /l^ rx,„ ^ (A; + 1)T} } 
^ Qkih) + Cr,,m,j{T,e)e-^^'/-\ (3.75) 

where the initial condition {^kT, 0) indicates that at time kT, we also restart the process 
of the deterministic slow variables yf^^ in the point ykx € 2^o- In the second line, we used 
Lemma 3.7. This shows (3.73). 

As for (3.72), we again start from a decomposition, similar to (3.75): 

+ EO'(0'0){l{,^(^^^fer}P'=^'«'=-o){rB(,) < {k + l)TATvo}}. (3.76) 
Corollary 3.5 allows us to estimate 
Pk+i{h) 

^ Pik(M +E°'(°'°){l{(^,,,x(,,,)-i^,,)^ft2}[¥'((efeT,^(yfeT)-'efer)) A 1] I Tvo ^ kT] 

X P°'(°'°){too ^ fcT}, (3.77) 

with 

'Pis) = C+^,^{T, e) e(^-«)'^V-^ ^-^(h--s)/a- . (3.73) 

(3.71) shows that 

F°'^°'°H(^feT,X(yfcT)-'6T) <s\tvo> kT} ^ <fk{s), (3.79) 

where 

^k{s) :=(1 - 5fee--/-')/P°'(°'°){Ti,„ ^ kT}. (3.80) 
The functions ip and ipk fulfil the assumptions of Lemma 3.6 with 

e^^^/'^' = C+„,^(r,e)-ie^'^'/^' and e^^^/^' = 5^. (3.81) 

For /i^ ^ si, (3.70) becomes trivial, while for > si, Lemma 3.6 shows 

Pfe+i(^) ^ Pfe(/i) - (^(^2 S2) [1 - P°'^°'°H(CfeT, ^(j/jfcT)"'ar) < /i', r^,„ ^ fcT}] 

+ V{si)+ 9''(s)Z)fce-'^^/'^ ds 

J Sl 

^ Pk{h) + C+^,^(r, £)5fc-^ e(^-«)'^Vc^^ e-'^'^V-^ . (3.82) 



Now, (3.72) is immediate. □ 
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Repeated application of the previous result finally leads to the following estimate. 
Corollary 3.10. Assume that yo G Vq, xq = x{yo,e). Then, for every t > 0, we have 



7 



2 Tj 2(7 -k) 
In addition, the distribution of the endpoint satisfies 



-(2k-7)/i2/ct2 



(3.83) 



(3.84) 



Proof: We already know the bounds (3.70) and (3.71) to hold for k = I, with Di = 
C+^ ,|,(T, e) and Di = Cn,m,'y{T,s). Now the inductive relations (3.72) and (3.73) are seen 
to be satisfied by 



1 + Cn,m,^{T, e)^e^^-')'^'l'^' 

7 — «; ^-^ 



(3.85) 



The conclusion follows by taking k= {t/T'] and bounding the sum by \{t/T){t/T + 1) ^ 
\{t/T + l/2)\ □ 

To complete the proof of Part (a) of Theorem 2.4, we first optimize our choice of k, 
taking into account the constraint k ^ k^{Q) A k'. By doing so, we find that 



2(7 - k) (7^ 
where we have set 

K = 7[1 - 0{h) - 0(A) - 0{mep^) - 0(6-=""^*/^ /(I - 27))] . 
Simplifying the prefactor in (3.83) finally yields the upper bound 

[l+tf\ 1 



(3.86) 



(3.87) 



^ const 



Ae [(1-27)' 



+ e"/^ + e™/^ 



(3.88) 



Note that the lower bound in Part (b) of Theorem 2.4 is a direct consequence of the 
lower bound in Corollary 3.5, so that only Part (c) remains to be proved. 

3.3 Approaching the adiabatic manifold 

The following result gives a rather rough description of the behaviour of paths starting 
at a (sufficiently small) distance of order 1 from the adiabatic manifold. It is, however, 
sufficient to show that with large probability, these paths will reach the set B{h), for some 
^ > cr, in a time of order £|log/i|. 
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Proposition 3.11. Let t satisfy the hypotheses of Corollary 3.5. Then there exist con- 
stants ho, Sq, Co and Kq such that, for h ^ ho, S ^ Sq, j E (0, 1/2) and A > sufficiently 
small. 



sup J 

«o : («o,X(2/o)-ieo>^<52 



50,(«o,0) 1 



Io^Sat^o {h + co6e-^os/ey 



^ 1 



(3.89) 



Ae 



(1 - 27)^/2 



+ (e-/4+2e"^/^)e 



rn/i\ —Kh^/a^ 



where ac = 7[1 - 0{h) - ©(A) - 0{mep'^) - 0{5)\. 

Proof: We start again by considering an interval [s, t] with t — s = Ae. Let y^^^ = 2/o £ 
Po- Then 



i>0,(?o,0) 



sup , „ ; ^n 

^pO,tto,o)f (e„,X(yf*)-ie«>^i?' 

where r is a stopping time defined by 

r = r^,o A A inf{«^0: X(y^et)-iQ ^ (/i + co(5e-^°"/^)2(l - 0(A))}, (3.91) 

and H"^ is a shorthand for H'^ = = {h + co(5e-^o*/^)2(l - 0(A)). 

The probabihty on the right-hand side of (3.90) can be bounded, as in Proposition 3.1, 
by the sum Pq + A + -P2, defined in (3.17), provided Ho + Hi + H2 = H. Since \\U{s)\\ 
decreases like Q-^os/e _ ^KqA ^-Kot/e ^ have 



(ea,x(yj-ie«> 



^ 1 



(3.90) 



Pn ^ 



(1 - 27)»/2 



cxp 



-7 



H^- const <j2e2XoAg-2i^otA 



(3.92) 



Following the proof of Lemma 3.3, and taking into account the new definition of r, we 
further obtain that 



Pi ^ e"/^exp|- 



1 



(72 Mf const [{h + hif^it) + cl6\t/e) e-2^oV^] 
As for P2, it can be estimated trivially, provided 

M 

H2 ^ const —— 



}• 



{h^ + hhi + mep^a^)eit) + cl6^ e^«^ e"^"*/^ . 
Choosing iJi in such a way that the exponent in (3.93) equals H^/a^, we obtain 

^^l(l-27)"/2+' j 

X exp{-7:^ [1 - 0{A) - 0{mep^) - 0{h + hi + co(5)] }, 



(3.93) 



(3.94) 



(3.95) 



where we choose hi proportional to h + CQSe^°^e ^0*/^. The remainder of the proof is 
similar to the proofs of Lemma 3.4 and Corollary 3.5. □ 
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The preceding lemma shows that after a time ti of order e|log/i|, the paths are likely 
to have reached B(h). As in Lemma 3.7, an improved bound for the distribution of the 
endpoint can be obtained. Repeating the arguments leading to Part (a) of Theorem 2.4, 
namely using Lemma 3.6 on integration by parts and mimicking the proof of Corollary 3.10, 
one can show that after any time t2 ^ ii, the probability of leaving B{h) behaves as if the 
process had started on the adiabatic manifold, i.e., 

F°'«0'0)| sup {^s,X{ys)-%) ^ hA ^ Cl^,,,^{t,e)(l + ^) e^^^^V^^^ (3.96) 

uniformly for all such that {(,o, X{yo)~^^o) ^ 5^. Here 7 a(*' ^) same prefactor 
as in Theorem 2.4, cf. (2.29), and 

K+ = j[l- 0{h) - 0(A) - 0{mep^) - 0(5 e-'^''"^*^*^^^)/^ /(I - 27))]. (3.97) 

This completes our discussion of general initial conditions and, in particular, the proof of 
Theorem 2.4. 



4 Proofs — Dynamics of Q 

In this section, we consider again the SDE 

^t = -f{xuyt,£)<^t + -^F{xt,yu£)<^Wt, 

£ V£ (4.1) 

dyt = g{xt, yt, s) dt + a'G{xt, yt, s) dWt 

under Assumption 2.2, that is, when starting near a uniformly asymptotically stable man- 
ifold. We denote by (x^^*, yf^*), with xf^^ = x{yf^^,£), the deterministic solution starting 
in ^Q^* = yo € ^0- The system can be rewritten in the form (3.3), or, in compact notation, 
as 

dCt = [A{yf'\eKt+B{Cut,e)] dt + f7[.Fo(yf\e) +.Fi(Ci,t,e)] dWt, (4.2) 

where ('^ = (^^,??^), A and J^o have been defined in (2.39), and the components of 
B'^ = (£-16^, c^) and = {e-^/'^ F[ , pGj) satisfy the bounds (3.4). 

The solution of (4.2) with initial condition = {$,q , 0) can be written in the form 

Ct=U{t)Co + (r fu{t,s)rQ{yf\e)dWs 
Jo 

+ [ U{t,s)B{(:s,s,£)ds + a [ Uit,s)J^i{(:s,s,e)dWs. (4.3) 
Jo Jo 

The components of the principal solution U{t, s) satisfy the bounds 

\\U{t,s)\\ ^ const e-^°(*-*)/^ 

\\S{t,s)\\ ^ const||C||oo^(l-e-^o(*-^)/^) sup \\V{t,u)\\. (4.4) 

We want to estimate the first-exit time 

Tc = inf{« ^ 0: {Cu,^{u)-\u) ^ h^}, (4.5) 
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with Z{u) defined in (2.49). The inverse of Z{u) is given by 

/ (X - ZY-^Z'^)-^ -X-^ZiY - Z'^X-^Z)-^\ 

Z-^=[ . (4.6) 

\-Y-^Z^{X-ZY-^Z^)-^ (Y-Z^X-^Z)-^ J 

Since we assume ||X||oo and ||X~-'-||oo to be bounded, ||^||oo = 0{-\/ep+£) and ||5^~''"||[o,t] = 
0(l/(/92 + £)), we have 

_ ^ foil) 0{l) \ 
As in Section 3, we start by examining the dynamics of on an interval [s,t] with 

A = {t-S)/S = 0e{l). 

The following functions will play a similar role as the functions $ and introduced 
in (3.5), played in Section 3: 

$(0= / Tr[j{vfU{t,vfZ{t)-^U{t,v)J{v)]dv, 
Jo 

^{t)= I \\j{vfU{t,vfZ{t)-^U{t,v)J{v)\\dv, (4.8) 
Jo 

where 

Jiv) = — -^mv,v,e) = ^^f^ (4.9) 

for V ^ r^. Using the representations (2.41) of U and (4.6) of Z~^ and expanding the 
matrix product, one obtains the relations 

$(t) ^ «>(*) + / / Tr[V {t,vfY{t)-^V{t,v)] dv + 0{in + m){l + x^^Ht) + x^^\t))), 
Jo 

§(t) ^ *(t) + p2 f\\V{t, vfY{t)-^Vit, v)\\ dv + 0{1 + x«(t) + X^^\t)) , (4.10) 
Jo 

valid for all t ^ r^. Now we are ready to establish the following analogue of Proposition 3.1. 

Proposition 4.1. Fix an initial condition {xo,yo) with yo £ Vq and xo = x{yo,e), and 
let t be such that y^*^* G Pq for all u ^ t. Then, for all a G [0, 1], all 7 G (0, 1/2) and all 
II >0, 

sup pO'^oj sup {Cu,Z{u)-'Q^h''\ 

Co={«0,0) : {&,,X(?/o)-l«o><a2?i2 ]^s^u^tAT-Do ) 



(1 — 27)("+'")/2 

X exp|-7^ ^i-a'-0{A + e + ixh + h\\Z\\[o,t] (l + ll>^~'ll[o(?]X^'^ (*)))] } 
+ e5«/-« exp|4 /-^(l-^(^)) j (4.11) 

I C72l6M2||Z||[o,t]*(t)J 



/loWs whenever Vma^ ^ /i < l//x. 
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Proof: Writing = U{u, s)Tu, we have 



i>0,Co 



j sup {Cu,2iu)-\u)>hA=F^'<4 sup \\Qiu)Tu\\ ^ h] (4.12) 



where Q{u) is the symmetric matrix defined by 

Q{uf = U{u, sfZ{u)-^U{u, s). (4.13) 

As in the proof of Proposition 3.1, we want to ehminate the w-dependence of Q in (4.12). 
It turns out that the relation || Q(M)Q(i)~^ || = 1 + C(A) still holds in the present situ- 
ation, although the proof is less straightforward than before. We establish this result in 
Lemma 4.2 below. 

Splitting Tu into the sum = TO + + T^, where the are defined in a way 
analogous to (3.16), we can estimate the probability in (4.12) by the sum Pq + Pi + P2, 
where 



Po = P°'^» sup WQmlW^Ho}, 

Pi=F°'^"j sup \\Q{t)Tl\\^ hX (4.14) 
P2 = pO.Co/ sup \\Qit)rl\\^H2], 

and Ho + Hi + H2 = h{l — 0{A)). Following the proof of Lemma 3.2, it is straightforward 
to show that 



Pn ^ 



(1 _ 2^|(n+H/2 -p{-^(^o - - 0{s))}, (4.15) 



the sole difference being the factor 0{e) in the exponent which stems from the fact that 
(Co, '2^(0)~^Co) = {S,o, X{0)^^(^o){l + 0{£)). Furthermore, similar arguments as in the proof 
of Lemma 3.3 lead to the bound 

Pi^exp<^--^ — }. 4.16 

I 16a^Mfh^Hl\\Z\\[o,t]^{t) J 

Finally, the estimate 

\\Q{t)^lr.r,f ^ \\BiC^,v,efUit,vfZit)-^U{t,w)B{Cn„w,e)\\dvdw 

^ const [h''\\Z\\f,^,^{l + \\Y-%,^,]X^^\uf) + {mEp^a^)\l + x^^\u))], 

(4.17) 

which holds whenever h ^ V ma'^, shows that P2 = for 

H2 > o(/i2||^||[o,,](l + ||y-i||f/2x('Ht))+mepVyi + xW(i)). (4.18) 
Hence (4.11) follows by taking Hi = iih?{l - 0(A)). □ 
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In the proof of Proposition 4.1, we have used the following estimate. 
Lemma 4.2. For A = (t — s)/e sufficiently small, 

sup \\Q{u)Q{t)-^\\ = l + 0{A). (4.19) 

Proof: Using the fact that Q(f satisfies the ODE 
d 



^^Q{v)-'=u{s,v)MyT\^)MyT\^yi^is,vy > (4-20) 



we obtain the relation 



Q(u)2Q(i)-2 = 1+Q(uf fu{s,v)J^o{yt\e)J^o{yf\efU{s,vfdv. (4.21) 

Ju 

The definition of and the bound (4.4) on \\S\\ allow us to write 



^0(A + p/Vi) 0{Ah + p 



) 



Using the estimate (4.7) for Z ^ and the fact that we integrate over an interval of length 
Ae, it follows that 

Q{ufQ{tr'-l = Ai ^ ^ ^ (4.23) 
which implies (4.19). □ 



Now, Theorem 2.6 follows from Proposition 4.1, by taking a regular partition of [0,t] 
spacing Ae and p = 4Mi||2^||p^j^'(i)^/^. We use in particular the fact that ^(t) = 
I" X^''^H*) + X^'^\t))i ^'iid that the right-hand side of (2.51) exceeds 1 for ^ < \/rno^. 



11I/2 

0(1 + 

5 Proofs — Bifurcations 



We consider in this section the behaviour of the SDE (2.1) near a bifurcation point. The 
system can be written in the form 

dCt" = -f'i^r^zt, yt, t, e) dt + -^F-{^^,zt, yt, e) dWt, 

dzt = -f'{^t,zt, yt,e) dt + ^F\it,zt, yt,e) dWt, ^^'^^ 
<^yt = giCt ' zt, yt, £) dt + a'G{^'^ , zt, yt, e) dWt, 

compare (2.65) and (2.66). We consider the dynamics as long as {zt, yt) evolves in a neigh- 
bourhood of the bifurcation point, which is sufficiently small for the adiabatic manifold 
to be uniformly asymptotically stable, that is, all the eigenvalues of dxf~{0,z,y,e) have 
negative real parts, uniformly bounded away from zero. 
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5.1 Exit from B~{h) 

Let hrj, hz ^ 0. In addition to the stopping time 

Tr, = inf{s > 0: \\r]s\\ ^ /i,,}, 
cf. (3.6), we introduce the corresponding stopping time for Zg — Zg'^^, namely, 



Tz = inf |s > 0: 



~det| 



^hz}. 



(5.2) 



(5.3) 



The following result is obtained using almost the same line of thought as in Section 3.1. 

Proposition 5.1. Let t be of order 1 at most. Then, for all initial conditions such 
that , {yo, zo)^^^q) ^ a'^h'^ with an a e (0,1], all j G (0,1/2), and all sufficiently 
small A > 0, 

pO,(«o-,^o,yo)| sup {^;,X-iys,z,y'C)^h^} 



t 



+ 



t 



gO((m+g)3/V) 

l_27)(n-?)/2 ^^P|"^^ 



1 - a"^ - 0{A + {1 + n)h + hrj + hz) 



e(n-?)/4 



a^O{(l + {hr, + hz)/hY)\- 



(5.4) 



Proof: The proof is similar to the proof of Corollary 3.5, the main difference being the 
need for the additional stopping time Tz- Note that this results in error terms depending 
on hrj + hz instead of /t^ only. For /i^ ^ {m+q)a^, the term a^r~ = 0{{m + q)a^) yields an 
error term of order h in the exponent, while for /i^ < (m + q)cr'^, it produces the prefactor 

gO((m+g)3/2a)_ □ 

Next, we need to control the stopping times and r^. Lemma 3.4 holds with minor 
changes, incorporating the z^-dependent terms. We find that 

Lemma 5.2. Let satisfy {^q ,X~ {yo, zo)^^^^) ^ h^- Then 



pO,(^o,0) 



^ 2 



sup ||?7u|| ^ 



Ae 



(5.5) 



X exp< —kq 



h^,il - 0{Ae)) 



i-o(x(^)(t).,(i + | + | + („. + .)^) 



The contribution of a'^/h'^ to the error term might be puzzling at first glance, but we 
will apply the preceding lemma for /i^ chosen proportional to /i 2> cr, so that a'^/h^ wih 
actually be negligible. 

The next result allows to control the stopping time r^. Let U^{t, s) denote the principal 
solution of £( = A'^{zf^^,yf^^,e)(, where A'^{z,y,e) = dzf^{z,y,£), and define 



X^z\t)= sup fi sup \\U\s,v)\\) du, 
Xf{t)= sup f( sup \\U\s,v)f)du. 



(5.6) 
(5.7) 
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Lemma 5.3. Let satisfy {^q,X {yo,zo) ) ^ Then 



deti 



^ 2 



Ae 
X exp< — Ko 



sup 

e/i2(i_o(Ae)) 



<r^X^z\t) 



(5.8) 



Proof: The proof is almost identical with the proof of Lemma 3.4 and Lemma 5.2, with 
a' replaced by o■/^/£ and V replaced hy U^. □ 



Below, we will choose hz proportional to h/s/e for ^ S> cr, so that the term {m+q)a^ /h^ 
becomes negligible. 



Proof of Theorem 2.8. We can repeat the proof of Corollary 3.10 in Section 3.2, 
comparing the process to different deterministic solutions on successive time intervals 
of length T. The only difference lies in new values for the exponents k^{0) (resulting 
from Proposition 5.1) and k' . In fact, choosing /i^ proportional to h, hz proportional to 
(1 + x'^\T)/s)^^^h and, finally, /x proportional to 1 + (/t^ + hz)/h, shows that 



»o,(€o-,.o,yo)| sup {^;,X~{ys,Zs)-'C) ^ h'} 



(r,£)e- 



(5.9) 



valid for all satisfying (.^q , X {yo, zq) ) ^ a h and all T of order 1 at most. Here 



Ae 



,Cl({m+(7)3/V) 



(1 - 27)("-<?)/2 
1 - - 0{A) -0(^(1 + ^^^Y^) ^) 



(2) 



Similar arguments as in the proof of Lemma 3.7 yield a bound of the form 
where 



K = 7 



l-0{A)-0{ 1 + 



X. 



(2) 



E)).)-o(^ 



^-2KoT/e 
27 



(5.10) 
(5.11) 

(5.12) 
(5.13) 



In order for the estimates (5.9) and (5.12) to be useful, we need to take T of order e. 

However, this leads to an error term of order 1 in the exponent k', which is due to the fact 
that has too little time to relax to the adiabatic manifold. In order to find the best 
compromise, we take T = Oe /\1 and optimize over 9. Assume we are in the worst case, 
when 1 1 [/"I I grows exponentially like e^+*/^. Then xh\T) is of the order e9e^^+^. The 
choice 



[^(1-27)] 



l/(2(Xo+A-+)) 



(5.14) 



yields an almost optimal error term of order h'^{l — 27)^ '^|log(/i(l — 27))!, with v = 

(2) 

Kq/{Ko + K^). The smaller K^, i. e., the slower Xz (t) grows, the closer u is to one. □ 
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5.2 The reduced system 

Given the SDE (5.1), we call 

dz" = 1/0(0, zl yl e) dt + 4^0(0, y°, e) d^, ^ ^ 

s (5.15) 

dyO = 5(0, zlvle) dt + a'G(0, z^yl e) AWt 

the reduced system of (5.1). It is obtained by setting ^'^ = 0. Let = (-2?,y?) and 
C,t = {zt — z^,yt — Ut)- Subtracting (5.15) from (5.1) and making a Taylor expansion of 
the drift coefficient, we find that {^^ ,Ct) obeys the SDE 

1 ^ (5.16) 

dCt = - [C{C?, e)C + B{C?, sKt + c(er , C*, (?,£)] dt + -^Gi^^ , Ct, Ct°, e) dWt, 

where ||6|| is of order + ||CIP + (m + g)a^, ||c|| is of order + ||CIP and ||^|| is 

of order ||^~|| + \\(\\, while is bounded. The matrices A~ , B and C are those defined 
in (2.69), (2.79) and (2.80). 

For a given continuous sample path {Ct{^)}t^o of (5-16), we denote by U^^ and V^; the 
principal solutions of = A~ {(f{uj),e)£,~ and £( = B{(^{oo),£)C. If we further define 

SUt,s) = - f\ut,u)C{C{u),e)U^{u,s)du, (5.17) 

^ J s 

we can write the solution of (5.16) as 

Ci(a;) = ^ r V,(t,s)g(C(c^),C.(^),C°(^),£)d^s(^) 

+ ^ / 5,(t,s)F(C(a;),G(a;),C°(^),£)dW,(a;) 
+ - /* V.(i, s)c(C (a;), G(a;), C°(a;), e) ds 

+ - fsUt,sM;{u;),Cs{co),C'si'^),^)ds. (5.18) 
Concerning the first two summands in (5.18), note that the identities 



Vu;it,s)=VUt,0)VUs,0)-\ 

SUt, s) = S^{t, 0)U^is, 0)-^ + VUt, 0)SUs, 0)-^ 



(5.19) 



allow to rewrite the stochastic integrals in such a way that the integrands are adapted 
with respect to the filtration generated by {M^sjs^o- 

We now assume the existence of a stopping time r ^ TB-{h) deterministic functions 
'&{t, s), '&c{'t: such that 



||V^(t,s)|| ^i?(t,5), 
|V,(t,s)C(C°(^),£)|U^?c(i,s), 
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uniformly in £, whenever s ^ t ^ t{lo), and define 

X^^{t)= sup - / ^s,uydu, i = l,2, (5.21) 

X^c\t)= sup - / ( sup i^c{s,vy)du, i = l,2. (5.22) 

O^s^t £ Jo ^u^v^s ' 

The following proposition establishes a local version of Theorem 2.9. 

Proposition 5.4. Let A 6e sufficiently small, fix times s < t such that t — s = Ae, and 
assume that there exists a constant i^o > such that '&{u,s) ^ i^o and '&ciu,s) ^ i^o, 
whenever u G [s, t] . Then there exist constants KQ,hQ > such that for all h ^ ho [x^^^ (i) V 

sup WCuW >h}^ 2e^"^+i^/'exp\-Ko^^ -— |. 

(5.23) 

Proof: The proof follows along the lines of the proof of Lemma 3.4, the main difference 
lying in the fact that the stochastic integrals in (5.18) involve the principal solutions Ui^, Vuj 
depending on the realization of the process. However, the existence of the deterministic 
bound (5.20) allows for a similar conclusion. In particular, the first and second term 
in (5.18) create respective contributions of the form 

g(m+q)/4 I Hi 1 .525) 

^ 16amfx'b\t)^ 

to the probability (5.23) . The third and fourth term only cause corrections of order hx^^^ (t) 
and hxc^ (t) [1 + (m + q)a^/h'^] in the exponent. Note that we may assume h"^ ^ {m + q)a'^ 
as well as h^ {m + g)(T^Xc'*(*)) because Estimate (5.23) is trivial otherwise. □ 

Now Theorem 2.9 follows from Proposition 5.4 by using a partition of the interval [0, t] 
into smaller intervals of length Ae. 
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